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Abstract. Since the theorems of Schur and van der Waerden, numerous partition 
regularity results have been proved for linear equations, but progress has been scarce 
for non-linear ones, the hardest case being equations in three variables. We prove 
partition regularity for certain equations involving quadratic forms in three variables, 
£f~) ' showing for example that the equations 16x 2 + 9y 2 = n 2 and x 2 + y 2 — xy — n 2 are 

partition regular, where n is allowed to vary freely in N. For each such problem we 
establish a density analogue that can be formulated in ergodic terms as a recurrence 
property for actions by dilations on a probability space. Our key tool for establishing 
<—< , such recurrence properties is a decomposition result for multiplicative functions which 

is of independent interest. Roughly speaking, it states that the arbitrary multiplicative 
function of modulus 1 can be decomposed into two terms, one that is approximately 
periodic and another that has small Gowers uniformity norm of degree three. 
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O ! 1. Introduction and main results 

1.1. Partition regularity results for quadratic forms. An important question in 
, Ramsey theory is to determine which algebraic equations, or systems of equations, are 

partition regular over the natural numbers. In this article, we restrict our attention 
to polynomials in three variables, in which case partition regularity of the equation 
p(x,y,z) = amounts to saying that, for any partition of N into finitely many cells, 
some cell contains distinct x, y, z that satisfy the equation. 
t> ■ The case where the polynomial p is linear was completely solved by Rado |24j: For 

a,b,c 6 N the equation ax + by = cz is partition regular if and only if either a, b, or 
a + b is equal to c. The situation is much less clear for second or higher degree equations 
and only scattered results are known. A notorious old question of Erdos and Graham [6] 
| is whether the equation x 2 + y 2 = z 2 is partition regular. As Graham remarks in |13j 

"There is actually very little data (in either direction) to know which way to guess". More 
generally, one may ask for which a, b, c 6 N is the equation 

(1) ax 2 + by 2 = cz 2 
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partition regular. A necessary condition is that at least one of a, 6, and a + b equals c, 
but currently there are no a, b, c 6 N for which partition regularity of ([1]) is known. 

In this article, we study the partition regularity of equation JT]), and other quadratic 
equations, under the relaxed condition that the variable z is allowed to vary freely in N. 

Definition. The equation p(x, y,n) = is partition regular in N if for any partition of 
N into finitely many cells, for some n £ N, one of the cells contains distinct x, y that 
satisfy the equation. 

A classical result of Furstenberg-Sarkozy [25] is that the equation x — y = n 2 is 
partition regular. Other examples of translation invariant equations are provided by the 
polynomial van der Waerden Theorem of Bergelson and Leibman [3], but not much is 
known in the non-translation invariant case. A result of Khalfalah and Szemeredi 121 1 is 
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that the equation x + y = n 2 is partition regular. Again, the situation is much less clear 
when one considers non-linear polynomials in x and y, as is the case for the equation 
ax 2 + by 2 = n 2 where a, b € N. It is one of the main goals of this article to produce the 
first positive results in this direction. For example, we show that the equations 

lQx 2 + 9y 2 = n 2 and x 2 + y 2 — xy = n 2 

are partition regular (note that 16x 2 + 9y 2 = z 2 is not partition regular). In fact we 
prove a more general result for homogeneous quadratic forms in three variables. 

Theorem 1.1 (The three squares theorem). Let p be the quadratic form 

(2) p(x, y, z) = ax 2 + by 2 + cz 2 + dxy + exz + fyz 

where a, b are positive, c is negative, and d,e, f are arbitrary integers. Suppose that all 
three forms p(x,0, z), p(0,y,z), p(x,x,z) have non-zero square discriminants. Then the 
equation p(x, y,n) =0 is partition regular. 

The last hypothesis means that the three integers 

A i: =e 2 -4ac, A 2 := / 2 - 46c, A 3 := (e + f) 2 - 4c(a + b + d) 
are non-zero squares. As a special case, we get the following result: 

Corollary 1.2. Let a, b, and a + b be non-zero squares. Then the equation ax 2 + by 2 = n 2 
is partition regular. More generally, if a, b, and a + b + c are non-zero squares, then the 
equation ax 2 + by 2 + cxy = n 2 is partition regular. 

A partition Ci, . . . , C r of N induces another partition Ci, . . . , C r by the following rule: 
x G Cj if and only if x 2 € 6j . Applying Theorem 11.21 for the induced partition we deduce 
non-trivial results even for linear equations: 

Corollary 1.3. Let o, b, and a + b be non-zero squares. Then the equation ax + by = n 2 
is partition regular. 

Although combinatorial tools, Fourier analysis tools, and the circle method have been 
used successfully to prove partition regularity of equations that enjoy some linearity 
features (also for non- linear equations with at least four variables), we have not found 
such tools adequate for the fully non-linear setup we are interested in. Instead, we found 
greater utility to the recently developed toolbox of higher order Fourier analysis that 
relies on inverse theorems for the Gowers uniformity norms and various quantitative 
equidistribution results on nilmanifolds. We give a summary of our proof strategy in the 
next subsections. 

1.2. Parametric reformulation. In order to prove Theorem 1 1.1 1 we exploit some special 
features of the solution sets of the equations involved given in parametric form. In 
particular, we have the following result that is proved in Appendix [Bj 

Proposition 1.4. Let the quadratic form p satisfy the hypothesis of Theorem \l.l\ Then 
there exist £q,£\ positive and £2^3 non-negative integers with £2 7^ £3, such that for every 
k,m,n S N, the integers x = k£§m(m + £\n) and y = k£o(m + £2n)(m + ^n) satisfy the 
equation p(x, y, z) = for some z£N, 

For example, the equation 16x 2 + 9y 2 = z 2 is satisfied by the integers x = km(m + 3n), 
y = kirn + n)(m — 3n), z = k(5m 2 + 9n 2 + Qmn) (replacing m with m + 3n leads to 
coefficients of the announced form) , and the equation x 2 + y 2 — xy = z 2 is satisfied by 
the integers x = km{m + 2n), y = k(m — n)(m + n), z = k(m 2 + n 2 + mn). 

The key properties of the patterns involved in Proposition II. 41 are: (a) they are dilation 
invariant, which follows from homogeneity, (b) they "factor linearly" which follows from 
our assumption that the discriminants A\, A2 are squares, and (c) the coefficient of m in 



UNIFORMITY OF MULTIPLICATIVE FUNCTIONS AND PARTITION REGULARITY 



3 



all forms can be taken to be 1 which follows from our assumption that the discriminant 
A3 is a square. 

Using Proposition [T31 we see that Theorem II .H is a consequence of the following result: 

Theorem 1.5 (Parametric reformulation). Let £q,£i be positive and ^,£3 non-negative 
integers with £2 7^ £3- Then for every partition o/N into finitely many cells, there exist 
k,m,n S N such that the integers k£Qm(m + £±n) and k£o(m + £2n)(m + £371) are distinct 
and belong to the same cell. 

1.3. From partition regularity to multiplicative functions. Much like the transla- 
tion invariant case, where partition regularity results can be deduced from corresponding 
density statements with respect to a translation invariant density, we deduce Theorem 1 1.5 1 
from the density regularity result of Theorem 12.1 1 that involves a dilation invariant density 
(a notion defined in Section \2.1L 

In Section 12.21 we use a multiplicative version of the correspondence principle of 
Furstenberg to recast Theorem 12.11 as a recurrence property for measure preserving ac- 
tions of the multiplicative semigroup N on a probability space (Theorem I2.2p . 

In Section 12.31 we use a corollary of the spectral theorem for unitary operators (see 
identity 0) to transform the recurrence result into a positivity property for an integral of 
averages of products of multiplicative functions (Theorem 12. 5p . It is then this positivity 
property that we seek to prove, and the heavy-lifting is done by a decomposition result 
for multiplicative functions which is the main number theoretic result of this article 
(Theorem ll.6p . Assuming this result (we prove it in Sections [3][6j), the proof of the 
positivity property of Theorem 12.51 is completed in Section 12.61 The reader will also find 
there a detailed sketch of our proof strategy for this step. We discuss the decomposition 
result next. 

1.4. Multiplicative functions and Gowers uniformity. Our proof of the positivity 
property mentioned above necessitates that we decompose an arbitrary multiplicative 
function into two components, one that we can easily control, and another that behaves 
randomly enough to have a negligible contribution. For our purposes, randomness is 
measured by the Gowers uniformity norms. Before proceeding to the precise statement of 
the decomposition result we start with some informal discussion regarding the uniformity 
norms and the uniformity properties (or lack thereof) of multiplicative functions. 

Gowers uniformity. We recall the definition of the U 2 and £7 3 -Gowers uniformity norm 
from [9]. Here and later, for a function / defined on a finite set A we write 

E xeA f(x) : =-L]T/(x). 

' ' xeA 

Definition (Gowers uniformity norms). Given N E N and /: Z^v —> C, we define the 
U 2 (jLn)- Gowers norm of / as follows 

\\f\\U(z N ) = ®hez N Kez N f(n + h) • /(n)| 2 ; 

and the C/ 3 (Zjv)- Gowers norm of / as follows 

|Enez„/(n + h + h 2 ) ■ f(n + hi) ■ f(n + h 2 ) ■ f(n)\ 2 . 

In an informal way, having a small [/ 2 -norm is interpreted as a property of U 2 - 
uniformity, and having a small C/ 3 -norm as a stronger property of [7 3 -uniformity. Since 
II/IIc/ 3 (Zjv) — II/II(/ 2 (Zjv) f° r ever y function / on Z^r, [7 3 -uniformity implies C/ 2 -uniformity. 

We recall that the Fourier transform of a function / on Z^r is defined by 

f(0 := E neZjv /(n) e(-n£/A0 for £ G Z N , 
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where, as is standard, e(cc) := exp(27rix). A direct computation gives the following 
identity that links the C/ 2 -norm of a function / on Zjy with its Fourier coefficients: 

(3) Wf\\u H z N ) = £ l^)l 4 - 

Using this and Parseval's identity, we see that the ^-uniformity of a function can be 
interpreted as the function having small Fourier coefficients, that is, small correlation 
with linear phases. We would like to stress though, that this property does not guarantee 
C/ 3 -uniformity; a function bounded by 1 may have small Fourier coefficients, but large 
?7 3 -norm. In fact, eliminating all possible obstructions to ^-uniformity necessitates the 
study of correlations with all quadratic phases e(n 2 a + n/3) and the larger class of 2-step 
nilsequences of bounded complexity (see Theorem 14. 2p . 

Multiplicative functions. In this article, we slightly abuse standard terminolog>|]] and 
define multiplicative functions as follows: 

Definition (Multiplicative functions). A multiplicative function is a function x' N — > C 
that satisfies x( mn ) = x{ m )x( n ) f° r an m,n € N. We denote by A4 the family of 
multiplicative functions of modulus 1. 

Note that elements of Ai are determined by their values on the primes. 

It so happens that several multiplicative functions do not share the strong uniformity 
properties of the Mobius function established in |18| . The next example illustrates some 
simple but very important obstructions to uniformity. 

Example (Obstructions to uniformity). Let x £ A4 be defined by = — 1 and x(p) = 
1 for every prime p ^ 2. Equivalently, x(2 m (2/c + 1)) = (— l) m for all k,m > 0. Then 
^i<n<NX( n ) = 1/3 + o(l) and this non-zero mean already gives an obstruction to U 2 - 
uniformity. But this is not the only obstruction. Indeed, we have Ei< n <Ar( — l) n x(n) = 
—2/3 + o(l), and this implies that x ~ 1/3 does not have small C/ 2 -norm. 

Examples similar to the previous one show that normalized multiplicative functions 
can have significant correlation with periodic phases and thus this is an obstruction to U 2 - 
uniformity that we should take into account. However, it is a non-trivial fact that plays a 
central role in this article, that correlation with periodic phases are, in a sense to be made 
precise later, the only obstructions not only to J7 2 -uniformity but also to -uniformity 
of multiplicative functions. For C/ 2 -uniformity this is already indicated by an old result of 
Daboussi [5], which states that if a is irrational, then sup^g^ \^i<n<NX( n ) e(na)| — > 
as N — > oo. The proof of this result was later simplified and extended by Katai |20] (for 
good quantitative versions of these results see [H [23]), and it is a simple orthogonality 
criterion obtained in this article of Katai (see Lemma I3.2p that is going to be a key 
number theoretic input for this article. 

1.5. [/^-Decomposition of multiplicative functions. We proceed now to the formal 
statement of our main decomposition result. We are given positive integers ^1,^2,^3 and 
these are considered fixed through this article. From this point on we let £ be the number 

Also, given N £ N we let 

[N]:={1,...,N}. 

It is often easier to work on a cyclic group rather than an interval of integers, as this 
makes Fourier analysis tools more readily available. In order to avoid roundabout issues, 
we introduce the following notation. Given iV G N, we denote by the smallest prime 



The functions we call multiplicative are often called completely multiplicative as opposed to functions 
like the Mobius that are called multiplicative. 
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that is greater than WIN. By Bertrand's postulate we have N < 20£N. For every 
multiplicative function x £ .M and every N £ N, we denote by xn the function on Z^, 

or on [N], defined by 

j X (n) ifne[N}; 
I otherwise. 



(4) Xn{u) 



The domain of xn will be each time clear from the context. Working with the truncated 
function X' lrjvi > ra ther than the function Xi ls a technical maneuver and the reader will 
not lose much by ignoring the cutoff. We should stress that from this point on, Gowers 
norms are going to be defined and Fourier analysis is going to happen on the group Z^ 
and not on the group Zjv- 

We can now state the main decomposition result that will be used below in the proof 
of the theorems of partition regularity. Its essence is that the restriction of an arbitrary 
multiplicative function x £ -M on a finite interval [N] can be decomposed into three 
pieces, one that is approximately periodic, one that has small L 1 -norm, and one that 
has extremely small [7 3 -norm. In addition, the structured component enjoys some very 
important features, for example, it is a convolution product of xn with a positive kernel 
that is independent of x an d its approximate period is bounded by a constant that does 
not depend on x or on N. 

Definition. By a kernel on we mean a non- negative function with average 1. 

Theorem 1.6 (Strong decomposition on average for the C/ 3 -norm). For every positive 
finite measure v on the compact group Ai of multiplicative functions having modulus 1, 
every function F:NxNx IR + —> M + , every e > 0, and every sufficiently large N £ N, 
depending only on F and e, there exist positive integers Q and R that are bounded by a 
constant which depends only on F and e, such that, for every the function xn 

admits the decomposition 

XN{n) = XN,s{ n ) + XN,u(n) + XN,e( n ) f or ever V n G Z AH 
where XN,s, XN,u, and XN,e satisfy the following properties: 

(i) xn,s = XN* ipN,l and xn,s + XN,e = XN* ipN,2, where ip N>1 and ip N<2 are kernels 
on Z^ that do not depend on x> and the convolution product is defined in Z^; 

R ~ 

(ii) \xN,s(n + Q) — XN,s(n-)\ < -~ for every n £ Z^, where n + Q is taken mod N; 

(iii) WxnJus^) < F{Q:Rj£ y 

(iv) E neZ „ J M \xnM\ du(x) < e. 

Remarks. (1) For arbitrary bounded sequences, decomposition results with similar flavor 
have been proved in |10 | II 126 } 127). but working in this generality necessitates the use 
of structured components that do not satisfy the strong rigidity condition of Property (jn]). 
An additional important feature of our result is that the structured component is defined 
by a convolution product with a kernel that is independent of x- All these properties 
play an important role in the derivation of the combinatorial result in Section [2j 

Extensions of Theorem 1 1 . 6 1 that give information about the C/ fc -norm of the component 
Xn,u f° r k > 4 appear to be significantly more complicated, and as they are not needed 
for the purposes of this article will not be established here. 

(2) We only plan to use this theorem for the function F(x, y, z) = cx 2 y 2 /z 4 where c is 
a constant that depends on £ only. Restricting the statement to this function does not 
simplify our proof though. 

(3) The bound (|iv|) is not uniform in x as i n P ar t (fiu]) of Theorem 16.11 below. We do 
not know if this weakening on the bound is needed or is an artifact of our proof. 
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(4) It is a consequence of Property ([!]) that for fixed F, N, e, u, the maps X ^ Xs,X ^ 
Xu,X^ Xe are continuous, and \\Xs\\l^(Z^) < 1, ||XulU°°(z#) < 2 > ||Xe|U«>(2 if ) < 2 - 

Most of the work in the proof of Theorem 11.61 goes into verifying the decomposition 
result of Theorem 16.11 that gives weaker bounds on the uniform component of the decom- 
position. Two ideas that play a prominent role in its proof, roughly speaking, are: 

(a) A multiplicative function that has C/ 2 -norm bounded away from zero correlates 
with a linear phase that has frequency close to a rational with small denominator. 

(b) A multiplicative function that has ?7 3 -norm bounded away from zero necessarily 
has t/ 2 -norm bounded away from zero. 

The proof of (a) uses classical Fourier analysis tools and is given in Section [3l The 
key number theoretic input is the orthogonality criterion of Katai stated in Lemma 13.21 

The proof of (b) is much harder and is done in several steps using higher order Fourier 
analysis machinery. In Section0we combine Katai's criterion with a quantitative equidis- 
tribution result on nilmanifolds of Green and Tao (Theorem 15. ip to study the correlation 
of multiplicative functions with nilsequences. These results are then combined in Sec- 
tion [6] with modifications of the C/ 3 -inverse theorem (Theorem I4.2[) and the factorization 
result (Theorem 14. 4p of Green and Tao, to conclude the proof of the weak decomposition 
result of Theorem 16.11 We defer the reader to Section 16.11 for a more detailed sketch of 
our proof strategy. 

Upon proving the weak decomposition result of Theorem 16. 11 the proof of Theorem 1 1.6 1 
consists of a Fourier analysis energy increment argument, and avoids the use of finitary 
ergodic theory and the use of the Hahn-Banach theorem, tools that are typically used 
for other decomposition results (see |101 HH ll~2" l I16 [ 127]). 

1.6. Further directions. Theorem [TTT] establishes that the equation 
(5) ax 2 + by 2 = cn 2 

is partition regular provided that all three integers ac, be, (a + b)c, are non-zero squares. 
Two interesting cases, not covered by the previous result, are the following: 

Problem 1. Are the equations x 2 + y 2 = n 2 and x 2 + y 2 = 2n 2 partition regular^ 

Let us explain why we cannot yet handle these equations using the methods of this 
article. The equation x 2 + y 2 = 2z 2 has the following solutions: x = k(m 2 — n 2 + 2mn), 
y = k(m 2 — n 2 — 2mn), z = k(m 2 + n 2 ), where k,m,n G Z. The values of x and y 
do not factor in linear terms, which leads to the major obstacle of not being able to 
establish uniformity estimates analogous to the ones stated in Lemma [2. 71 The equation 
x 2 + y 2 = z 2 has the following solutions: x = k(m 2 — n 2 ), y = 2kmn, z = k(m 2 + n 2 ). 
In this case, it is possible to establish the needed uniformity estimates but we are not 
able to carry out the argument of Section 12.61 in order to prove the relevant positivity 
property (see footnote [3] below for more details). 

A set E C N has positive (additive) density if lim sup^r^^ \E fl [7V]|/iV > 0. It 
turns out that the equations of Corollary 11.21 have non-trivial solutions on every infinite 
arithmetic progression, making the following statement plausible: 

Problem 2. Does every set BcN with positive density contain distinct ijfN that 
satisfy the equation lQx 2 + 9y 2 = n 2 for some n £ N ? 

We say that the equation p(x,y,z) = 0, p G Z[x,y,z], has no local obstructions if for 
every infinite arithmetic progression P, there exist distinct x,y,z G P that satisfy the 

2 Note that the equation x 2 + y 2 — 3n 2 does not have solutions in N. Furthermore, the equation 
x 2 + y 2 — 5n 2 has solutions in N but it is not partition regular. Indeed, if we partition the integers in 
6 cells according to whether their first non-zero digit in the 7-adic expansion is 1, 2, . . . , 6, it turns out 
that for every n £ N the equation has no solution on any single partition cell. 
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equation. For example, the equations x 2 + y 2 = 2z 2 and 16x 2 + 9y 2 = 25z 2 have no local 
obstructions. 

Problem 3. Let p E l*[x,y,z] be a homogeneous quadratic form and suppose that the 
equation p(x, y,z) = has no local obstructions. Is it true that every subset of N of 
positive density contains distinct x,y,z that satisfy the equation? 

As we mentioned before, there are no values of a, b, c E N for which the equation 
(6) ax 2 + by 2 = cz 2 

is known to be partition regular and the condition "at least one of a, b, and a + b equals 
c" is necessary for partition regularity. 

Problem 4. Are there a, b, c E N for which equation (|6|) is partition regular? 

Notable examples are the equations x 2 + y 2 = z 2 and x 2 + y 2 = 2z 2 . In [19] it is 
conjectured that the second equation is partition regular. 

1.7. Notation and conventions. We denote by N the set of positive integers. 
For JVENwe denote by [N] the set {1, . . . , N}. 

For a function / defined on a finite set A we write E xg ^/(x) = YlxeA f(x). 
With A4 we denote the set of multiplicative functions x : N — > C with modulus 1. 
Throughout, we assume that we are given £\,£2,£3 E N and we set £ = £\ + £2 + £3- 
A kernel on Z^r is a non-negative function on Zjv with average 1. 

For N E N we let N be the smallest prime that is larger than W£N (then N < 20£N). 

Given x £ -M. and N E N we let xn '■ [N] — > C be defined by \N = X " ^-[N]- The domain 
of xn is sometimes thought to be Zj^. 

For technical reasons, throughout the article all Fourier analysis happens on and all 
uniformity norms are defined on Z^. 

If x is a real, e(x) denotes the number exp(2nix), \\x\\ denotes the distance between x 
and the nearest integer, \x\ the largest integer smaller or equal than x, and \x] the 
smallest integer greater or equal than x. 

Given s E N we write k = (ki, . . . , k s ) for a point of Z s and ||k|| = \k\\ + ■ ■ ■ + \ k s \. For 

u = (ui, . . . ,u s ) E T s , we write k • u = k\U\ + • • • + k s u s . 

Let / be a function on a metric space X with distance d. We define 

11/11 iff m 1 - f{y)\ 

\\f Lip(X) = SUp /(X + SUp r . 

xex x,yex d(x,y) 

There is a proliferation of constants in this article and our general principles are as follows: 
The constants £\,£2, £3, are considered as fixed throughout the article, and quantities 
depending only on these numbers are considered as universal constants. The letters £q, 
£, and c, c\, C2, are reserved for constants of this type independently of whether they 
represent small or large quantities. Quantities that depend on one or more variables are 
denoted by Roman capital letters C, D, K, ... if they represent large quantities, and by 
low case Greek letters 7, 8, e, . . . if they represent small quantities. It will be very clear 
from the context when we deviate from these rules. 

2. Proof of partition regularity assuming the decomposition result. 

The goal of this section is to prove the combinatorial Theorem 11.51 assuming the de- 
composition result of Theorem 11.61 (this is proved in Sections [3}[6]) . We begin by giving 
three successive reformulations of Theorem 11.51 
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2.1. Reduction to a density regularity result. We first recast Theorem 11.51 as a 
density regularity statement for dilation invariant densities on the integers. 

Definition (Multiplicative F0mer sequence). The sequence ($at)jv 6 j^ of finite subsets of 
N is a multiplicative F0lner sequence if for every n G N 

\n<S> N A$ N \ 

Inn : ; = (J 

N^+oo |<Pjv| 

where n§N := {nx: x G <&n}- 

The sequence (^n)n^n defined by &n '■= {Pi 1 • • 'V^ 1 '■ < k\, . . . , fc^v < N}, where 
P\,P2, ••• is the sequence of primes, serves as a typical example. To a given multiplicative 
F0lner sequence we associate a notion of multiplicative density as follows: 

Definition (Multiplicative density). The multiplicative density d mn \t{E) of a subset E 
of N (relatively to the multiplicative F0mer sequence (3>j\r)) is defined as 

d m ult(E) := hmsup — — - — . 

N->-+oo \^N\ 

We remark that the multiplicative density and the additive density are non-comparable 
measures of largeness. For instance, the set of odd numbers has zero multiplicative 
density with respect to any multiplicative F0lner sequence, as has any set that omits all 
multiples of some positive integer. On the other hand, it is not hard to construct sets 
with multiplicative density 1 that have additive density (see for instance [JJ). 

An important property of the multiplicative density, and the reason we work with this 
notion of largeness, is that for every BcN and n G N, we have 

dmn\t(nE) = d mu \ t (E) = d mu i t (n _1 £;), where n~ x E := {x G N: nx G E}. 

Since a multiplicative density is clearly sub-additive, any finite partition of N has at 
least one cell with positive multiplicative density. Hence, Theorem 11.51 follows from the 
following stronger result: 

Theorem 2.1 (Density regularity). Let A) 5 ^i>^2i^3 be as in Proposition \T^\ Let J?cN 
be a set with positive multiplicative density. Then there exist k,m,n G N such that the 
integers kl^mim + i\n) and klo(m + £2n)(m + i^n) are distinct and belong to E. 

In fact, we show that for a set of (m, n) G N 2 of positive (additive) density the asserted 
property holds for a set of k G N of positive multiplicative density. 

2.2. Reduction to a recurrence results for actions by dilations. Our next goal is 
to reformulate the density statement of Theorem 12. II as a recurrence statement in ergodic 
theory. 

Definition. An action by dilations on a probability space (X,B,/jl) is a family (T n ) ne ^ 
of invertible measure preserving transformations of (X,B,f/,) that satisfy 

T\ := id and for every m,n£N, T m oT n = T mn . 

We remark that an action by dilations on a probability space (X, B, p) can be extended 
to a measure preserving action (T r ) rg Q+ of the multiplicative group Q + by defining 

T a i h :=T a T- x for all a, b G N. 

We will use a multiplicative version of the (additive) correspondence principle of 
Furstenberg [8] . Its proof can be found in [JJ . 

Multiplicative correspondence principle. Let E be a subset ofN. Then there exist 
an action by dilations (T n ) nG N on a probability space (X,B,p), and a set A G B with 
p(A) = d mu it(E), such that for every k G N and for all n\,U2, ■ ■ ■ ,n/- G N, we have 

d mult {n^E n n^E n • • • D n^E) > n{T~^A n T^A n • • • D T~^A). 
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Using this correspondence principle we can recast Theorem 12.11 as a recurrence state- 
ment in ergodic theory regarding actions by dilations. 

Theorem 2.2 (Recurrence). Let be as in Proposition \1.4\ Let (T n ) n ^ be an 

action by dilations on a probability space (X, B, fx). Then for every A G B with [i(A) > 7 
there exist m,n G N, such that the integers m(m + l\ri) and (m + lin){m + l%n) are 
distinct, and 

u(T~} sAnTr\ e w .A)>0. 

f\ m(m+t\n) (m+l.2n)(m+lzn) ) 

Proof of Theorem \2.1\ assuming Theorem \2.2\ Let E C N have positive multiplicative 
density. Let the probability space (X,B,/J,), the action (T n ) n ^, and the set A G B 
be associated to E by the previous correspondence principle. Let £q, £\^2^3 be as in 
Proposition ll.4i In order to show that there exist integers k,m,n G N satisfying the 
conclusions of Theorem 12.11 it suffices to show that there exist m, n G N so that the 
integers £om(m + £\n) and £o{m + £in){m + £%n) are distinct, and satisfy 

Since /x is T io -invariant, the left hand side equals ^{T'^^AnT^^^^A) , and 
the existence ofm,n £ N satisfying the asserted properties follows from Theorem 12. 21 □ 

The degenerate case where £\ = £2 (and similarly if £\ = £3 or £2^ = 0) is rather 
trivial. Indeed, we are then reduced to establishing positivity for fi^T^A D T^_^ n A) . 

Letting m = £3, n = 2 n> — 1, we further reduce matters to showing that ^(AnT^ A) > 
for some n' G N, and this follows from the Poincare recurrence theorem applied to T2. 
Therefore, in the rest of this article, we can and we will assume that ^1,^2 1^3 are distinct 
positive integers. In this case, our goal is to show that the set of pairs (m, n) satisfying 
the conclusion of Theorem 12.21 has positive (additive) density in N 2 : 

Theorem 2.3 (Recurrence on the average). Let £±,£2, £3 G N be distinct. Let (T n ) nG ^ 
be an action by dilations on a probability space (X,B,fi). Then for every A G B with 
fJ.(A) > we have 

K£ E ( m '") ee ^ ^( T m(m+e 1 n) AnT [m+E 2 n)(m+e 3 n) A ) > 
where N = {(m,n) G [N] X [N] : 1 < m + An < N fori = 1,2,3}. 

Remark. In fact, we prove more: the liminf is greater or equal than a positive constant 
that depends only on the measure of the set A. 

Proof of Theorem \2.2\ assuming Theorem \2.3l It suffices to notice that for N sufficiently 
large we have |Ojv| > C\N 2 and the cardinality of the set of pairs (m,n) G @n that 
satisfy m(m + £\ri) = (m + £2n){m + £%n) is bounded by C2N for some constants c\ and 
C2 that depend only on £\,£2, £3- D 

2.3. Reduction to a positivity property for multiplicative functions. Next, we 
show that Theorem 12.31 is equivalent to a positivity property for multiplicative functions. 

Recall that the set A4 consists of all multiplicative functions of modulus 1. When en- 
dowed with the topology of pointwise convergence, Ai is a compact (metrizable) Abelian 
group. If {p\,P2, • • • } denotes the set of primes, then a multiplicative function \ is deter- 
mined by its values on the primes. The map x ^ {x{Pn))n&n is an isomorphism between 
the groups M and T N . The space of multiplicative functions Ai is the dual group of the 
multiplicative group Q + , the duality being given by 

x(m/n) = x( m )x( n ) f° r every \ £ M. and every m,n G N. 

Recall that an action (T n ) ne psj by dilations on a probability space (X, B, /i) extends to 
a measure preserving action of the multiplicative group Q + on the same space. Since Ai 
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is the dual group of this countable Abelian group, by the spectral theorem for unitary 
operators, for every function / € L 2 (^l) there exists a positive finite measure v on the 
compact Abelian group A4, called the spectral measure of f, such that, for all m,n G N, 

(7) / T m f-T n fdn= T m/n f -J djj, = x(m/n)du(x) = x(m) x{n) dv(x)- 
JX J JM JM 

Let A £ B. Letting / = 1a i n © an d using the multiplicativity of elements of M, we 

get for all m, n S N that 

^ T ^\m+i in ) A n T [m+e 2 n)(m+i s n) A ) = / x{m{m + l x n)) x(m + £ 2 n)(m + £ 3 n)) dv{ X ) 

J M 

x{ m )x( m + ^i n ) x( m + ^2 n ) x(w + £ 3 n)dv{x)- 
From this identity we deduce that Theorem 12.31 is equivalent to the following result: 

Theorem 2.4 (Spectral reformulation of recurrence result I). Let l x , £ 2 ,£ 3 GNbe distinct 
and (T n ) ng pj be an action by dilations on a probability space (X,B,/J,). Then for every 
A 6 B with fJ.(A) > 0, writing v for the spectral measure of we have 

(8) lim inf / E (m n)6e N x{m)x{m + t\n) x{m + i 2 n) x{m + £ 3 n) dv{x) > 
where is as in Theorem \2.3l 

Remark. An alternate (and arguably more natural) way to try to prove Theorem 12.21 
is to replace the additive averages in Theorem 12.41 with multiplicative ones. Upon doing 
this, one is required to analyze averages of the form 

^ m ,na^ N x{m(m + tin)) x((m + £ 2 n)(m + £ 3 n)) 

where ($7v)iVeN is a multiplicative F0lner sequence in N and x £ -M. Unfortunately, we 
were not able to prove anything useful for these multiplicative averages, although one 
suspects that a positivity property similar to the one in ([5]) may hold. 

Next, for technical reasons we reformulate Theorem 12.41 as a positivity property in- 
volving averages over Z^. This is going to be the final form of the recurrence statement 

that we aim to study. Recall that N was defined in Section 11.51 and the functions xn on 
were defined by in the same section. 

Theorem 2.5 (Spectral reformulation of recurrence result II). Let (T n ) n ^ be an action 
by dilations on a probability space (X,B,fj>). Then for every A E B with ^(A) > 0, 
writing v for the spectral measure of 1a, we have 

(9) liminf / E mtn& l m (n)xN(m)XN(m+£ 1 n)x N (m+£ 2 n)x N (rn+£ 3 n)di'(x)>0, 

N^-oo J M 

where in the above average the expressions m + £{n can be considered as elements of Z 
or without affecting the value of the average. 

We check that Theorems 12.41 and 12.51 are equivalent. Using the definition of the set 
Oat given in Theorem 12.31 we can rewrite the averages that appear in the statement of 
Theorem 12.41 as follows 

^{rn,n)ee N x{m)x{m + hn) x{m + £ 2 n) x{m + £ 3 n) = 




The value of the last expression remains unchanged when we replace each term m + £{n 
by m + £in mod N. Using this identity and that cN 2 < |0j\r| < N 2 for some positive 
constant c that depends only on £, we get the asserted equivalence. 
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2.4. Some estimates involving Gowers norms. Next we establish two elementary 
estimates that will be used in the sequel. The first one will be used in Section [6j 

Lemma 2.6. Let N be prime. For every function a: Z^r — > C and for every arithmetic 
progression P contained in the interval [N], we have 

|E n6 [jv]lp(n) • a(n)| < ci||a||[/2 (Zjv ) 

for some universal constant c\ . 

Proof. Since N is prime, the [7 2 -norm of a function on Zjv is invariant under any change 
of variables of the form x \— > ax + b, where a, b £ N and a / mod N. By a change of 
variables of this type, we are reduced to the case that P is an interval {0, . . . , m} with 
< m < N, considered as a subset of Z^r. A direct computation then shows that 

117(01 ^mmr ^ 2 N-o for t- 1 --*- 1 - 

and as a consequence 

Il 1 p(0lli4/3([AT]) < Cl 

for some universal constant c\. Using this estimate, Parseval's identity, Holder's inequal- 
ity, and identity ([3]), we deduce that 

|E ne [AT]lp(n) • a(n)\ = \ ^ l P (f) • a(£)| < ci • ( ^ |a(£)| 4 ) 1 = d \Mu*{i N )- D 

The next estimate is key for the proof of Theorem 12.51 It is the reason we seek for a 
J7 3 -decomposition result in this article. 

Lemma 2.7 (C/ 3 -uniformity estimates). Let ai, i = 0,1,2,3, be functions on Z^ with 
1 1 £00(2^) < 1 and £±,£ 2 ,£ 3 £ N be distinct. Then there exists a constant c 2 , depending 
only on £, such that 

\E m ,nez^[N](n)-a (m)-ai(m+£in)-a2(m+£ 2 n)-a 3 (m+£ 3 n)\ < c 2 min ([|aj||i/3( Z j) 1 / 2 . 

Proof. We first reduce matters to estimating a similar average that does not contain the 
term l^](n). Let r be an integer that will be specified later and satisfies < r < N/2. 
We define the "trapezoid function" <j) on Zj^ so that 4>(0) = 0, (p increases linearly from 
to 1 on the interval [0, r], cj>(n) = 1 for r < n < N — r, <fi decreases linearly from 1 to 
on [N -r,N], and <j){n) = for N < n < N. 

The absolute value of the difference between the average in the statement and 

E m ,n G z^<M™) • ao(m) ■ a\(m + £\n) ■ a 2 (m + £ 2 n) ■ a 3 (m + £ 3 n) 



is bounded by 2r/N. 

Moreover, it is classical that 

E,^, ciV 

for some universal constant c that depends only on £, and thus 
E m ,„ e z-0(n) • a (m) • ai(m + ^n) • a 2 (m + £ 2 n) • a 3 (m + ^ 3 n) 



iV 

cN 

max 



< 



E m,ti6Z^e(n(/JV) • a (m) ■ oi(m + fin) • a 2 (m + ^ 2 n) • a 3 (m + ^ 3 ra) 
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Furthermore, notice that upon replacing do(re) with ao(n)e(£*n£ / 'N) and a\{n) with 
ai(n)e(— £*n£/N), where £\£\ = lmod N, the L^-norm of all sequences remains un- 
changed, and the term e(n£/N) disappears. We are thus left with estimating the average 



E m ,ngz^ao("i) • ai(m + hn) ■ a 2 (m + £ 2 n) • a 3 (m + £ 3 n), 
which is known (see for example [27} Theorem 3.1]) to be bounded by 

U := min ||a.,-|lr/3fx~v 

Combining the preceding estimates, we get that the average in the statement is 
bounded by 

2r cN 
+ U. 

N r 



Assuming that U ^ and choosing r = [VUN/(40£)\ (then r < N/(A0£) < N/2) gives 
the announced bound. □ 

2.5. A positivity property. We derive now a positivity property that will be used in 
the proof of Theorem 12.51 in the next subsection. Here we make essential use of the fact 
that the spectral measure v is associated to a non-negative function on X, and also that 
the function that defines the convolution product is non-negative. 

Lemma 2.8 (Hidden non-negativity). Let the action by dilations (T n ) n ^ on the prob- 
ability space (X,B,fi), the subset A of X, and the spectral measure v on M be as in 
Theorem \2. 51 Let ip be a non-negative function defined on Z^. Then 

(XN * V0( n i) • (XN * ip){n 2 ) ■ (Xn * V0( n 3) • (Xn * i>)(nA) dv(x) > 

M 

for every m, 712,713,714 G Z^. 

Proof. The convolution product xn * ip is defined on the group Z^ by the formula 

(XN * VOW = Efeez-V'Cn - k) ■ XN(k). 

It follows that for every n G [N] there exists a sequence {a n {k)) k< ^i~ of non-negative 
numbers that are independent of x, such that for every x£JWwe have 



(XN*tp)(n)= a n(k)x(k)- 



N 



The left hand side of the expression in the statement is thus equal to 
1 



fcl,fc2,fc3,fc4GZjy 4=1 



Y] Warnih) I x(h) ■ x(h) ■ x(h) ■ X(h)du(x) = 

r~f. ^ Jm 

4 f 

^2 W a niih) I Xihfa) ■ x(hh)dv(x) 



ki,k2,k3,hA&fj 4=1 



4 r 



where the last equality follows from equation 0. This expression is non- negative since 
the function 1 A is non-negative, completing the proof. □ 
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2.6. Proof of Theorem 12.51 assuming Theorem 11.61 We start with a brief sketch 
of our proof strategy. Roughly speaking, Theorem 11.61 enables us to decompose the 
restriction of an arbitrary multiplicative function on a finite interval into three terms, a 
close to periodic term, a "very uniform" term, and an error term. In the course of the 
proof of Theorem 12.51 we study these three terms separately. The order of the different 
steps is important as well as the precise properties of the decomposition. First, we 
show that the uniform term has a negligible contribution in evaluating the averages in 
(|9l). To do this we use the uniformity estimates established in Lemma 12.71 It is for 
this part of the proof that it is very important to work with patterns that factor into 
products of linear forms in two variables, otherwise we have no way of controlling the 
corresponding averages by Gowers uniformity norms. At this point, the error term is 
shown to have negligible contribution, and thus can be ignored. Lastly, the structured 
term Xs is dealt by restricting the variable n to a suitable sub-progression where each 
function Xs gives approximately the same value to all four linear formsj^ it then becomes 
possible to establish the asserted positivity. In fact, the restriction to a sub-progression 
step is rather delicate, as it has to take place before the component Xe is eliminated (this 
explains also why we do not restrict both variables m and n to a sub-progression) , and in 
addition one has to guarantee that the terms left out are non-negative, a property that 



We now enter the main body of the proof. Recall that ^i,^2>^3 £ N are fixed and 
distinct and that £ = l\ +£2 +^3- We stress also that in this proof the quantities m + ^n 
are computed in Zjy, that is, modulo N. 

Let the action by dilations (T n )„ e N on the probability space (X,B,n), the set A £ B 
with fJ,(A) > 0, and the spectral measure v of / = 1^, be as in Theorem 12.51 We let 



where C3 and C4 are positive constants that will be specified later, what is important is 
that they depend only on £. Our goal is for all large values of N (how large will depend 
only on 5) to bound from below the average 



A(N) := / E m>neZ ^l [Ar] (n) • XN(m) • + hn) • XaK™ + hn) ■ XN( m + hn) dv(x). 



We start by applying the decomposition result of Theorem 11.61 taking as input the 
spectral measure is, the number s, and the function F defined above. Let 



be the numbers provided by Theorem 11.61 We recall that Q and R are bounded by a 
constant that depends only on 5. From this point on we assume that N is sufficiently 
large, depending only on 5, so that the conclusions of Theorem 11.61 hold. To ease the 
notation a bit, we omit the subscript N when we use the functions XN,s,XN,u,XN,e 
provided by Theorem 11.61 and for x £ A4, we write 



This coincidence of values is very important, not having it is a key technical obstruction that stops 
us from handling equations like x 2 + y 2 — n 2 . Restricting the range of both variables m and n does not 
seem to help either, as this creates problems with handling the error term in the decomposition. 

4 In a sense, our approach follows the general principles of the circle method. Each multiplicative 
function is decomposed into two components, with Fourier transform supported on major arc and minor 
arc frequences. The contribution of the "major arc component" is further analyzed to deduce the asserted 
positivity. The "minor arc component" is shown to have negligible contribution, and this step is the 
hardest, it is done using higher order Fourier analysis tools in the course of proving Theorem 11.61 





e := c^S 2 and F(x,y,z) = cl 




Q := Q(F, N, e, v) = Q{N, 8,u), R := R{F, N, e, v) = R(N, 8, v) 



XN{n) =Xs{n) + Xu(n) + Xe(n), n£Z 
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for the decomposition that satisfies Properties (ji|)- (jiv|) of Theorem 11,61 

Next, we use the uniformity estimates of Lemma [2.71 in order to eliminate the uniform 
component \ u from the average A(N). We let 

Xs,e = Xs + Xe 

and 



Ai(N) : = / E mtn& -l [N] (n)-XsA m )-XsA m + ^ n )-XsA m + i2n y^sA m + i 3n) dv{x). 
J M 

Using Lemma l2~Tl Property (JTJ) of Theorem [TTUJ and the estimates |x7v( n )l — 1) IXs,e(^)| < 
1 for every n E Z^, we get that 

(10) \A(N) - A 1 (N)\ < 4C2 

F(Q,R,e)2 

where c 2 is the constant provided by Lemma [2, 71 and depends only on t. 

Next, we try to eliminate the error term Xe- But before doing this, it is important to 
first restrict the range of n to a suitable sub-progression; the utility of this maneuver will 
be clear on our next step when we estimate the contribution of the leftover term Xs- We 
stress that we cannot postpone this restriction on the range of n until after the term Xe 
is eliminated, if we did this the contribution of the term Xe would swamp the positive 
lower bound we get from the term Xs- We let 

By Property ([!]) of Theorem 11.61 Lemma 12.81 applies to Xs,e- Note that the integers Qk, 
1 < k < rjN, are distinct elements of the interval [N]. It follows that 

X / 1 [JV]( n ) ■XsA m ) ■XsA m + ^ n ) ■XsA m + ^ n ) ■X s ,e(m + £ 3 n)dv(x) > 



M 

[ V N] 



U-1 JM 



Therefore, we have 

(12) AUN) > ^P-A 2 (N) > JLa 2 (N) = e — - — A 2 (N) 

where 
A 2 (N) := 

E me Z- ^ke[\r,N\] Xs,e( m ) ■ Xs,e( m + £ l k Q) ' Xs,e{ m + ^Q) " ~Xs,e( m + <Mx)- 
IM 

We let 
(13) 

M{N) := / E me z s Efce[hivj]XsM-X8(^ + 4Qfe)-X s (w + ^2Qfc)-X s (^ + 4QA;)di/(x). 
JM 

Since for every n E we have |x s (n)| < lj an d since |% Sje (ri) | = \Xs(n) + Xe( n )\ < 1 by 
Property ([I]) of Theorem ll.6| we deduce that 



(14) \A 2 (N) - A 3 (N)\ < 4 / E m&fi \xe(m)\ dv{ X ) < is 

J M 

where the last estimate follows by Part (jrv|) of Theorem 11.61 
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Next, we study the term A^(N). We utilize Property (JTTJ) of Theorem 11.61 namely 

R 



\Xs(n + Q) -Xs{n)\ < 



N 



for every reeZ^. 



We get for m G Z^, 1 < k < r/N, and for i = 1, 2, 3, that 

\ X s(m + iiQk) - Xs (m)\ < £ f fc i < £r]N i < ^ 

N N W 

where the last estimate follows from (jlip and the estimate iV > £/V. Using this estimate 
in conjunction with the definition (fl~3|) of A^(N), we get 



E, 



3s 



We denote by 1 the multiplicative function that is identically equal to 1. We claim that 
^({1}) > 5 2 . Indeed, if (^^O^eN is a multiplicative F0lner sequence in N we have 



lim 



and this is greater or equal than 



dv{x) 



lim 



lim 



nG*jv 



proving our claim. Using this we deduce that 

I E me% | Xs (m)| 4 cMx) > K{l})-E mez ^|l s (m)| 4 > 5 2 |E meZj ^ l s (m)|' 
J M 

Since l s = 1^ * ip for some kernel ^ on we have 

N 1 

E meZ _l s (m) = E meZ -E feeZ -l iV (A;)'0(m — k) = E keZf .l N (k) = -~ > — 
Combining the above we get 
(15) A 3 (N) > 



3e 



20 4 1 Q ' 

Putting CEO]), CL2]), (HU), and JT5]) together, we get 

1 / 5 2 _ \ 4c 2 



A(N) > e 



40£QR V20 4 



7e 



F(Q,R,ey. 



Recall that e = C3<5 2 , for some positive constant C3 that we left unspecified until now. 
We choose C3 < 1, depending only on £, so that 

for some positive constant C5 that depends only on t. Then we have 



A(N) > <5 2 -g£ 



4c 2 



QR F(Q,R,e) 



Recall that 



n 2 p 2 
F(Q,R,e) = 

where C4 was not determined until this point. We choose 

8c 2 c 3 



c 4 := 



C5 
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and upon recalling that e = c^5 2 we get 

Recall that Q and R are bounded by a constant that depends only on 6. Hence, A(N) is 
greater than a positive constant that depends only on 5, and in particular is independent 
of N, provided that N is sufficiently large, depending only on 5, as indicated above. This 
completes the proof of Theorem 12.51 □ 

3. Fourier analysis of multiplicative functions 

In this section we study the Fourier coefficients of multiplicative functions. Our goal 
is to establish a decomposition 

Xn = XN,s + XN,u 

similar to the one given in Theorem I6.1| but with the ?7 2 -norm in place of the £/ 3 - 
norm. We will then use this result in Section [H] as our starting point in the proof of the 
decomposition result for the C/ 3 -norm. 

Convention. In this section the functions xn are defined on Z^. In particular, all 
convolution products are defined on T,^ and the Fourier coefficients of \N ar e given by 

Xn(0 := E neZ - X N(n) e(-n£/N) for £ G Z#. 

Theorem 3.1 (Weak uniform decomposition for the f/ 2 -norm). For every 9 > there 
exist positive integers Q := Q(9) and R := R{9), and for every sufficiently large N , 
depending only on 9, there exists a kernel 4>n,0 on with the following properties: 
For every x £ jM, writing 

XN,s = XN * <t>N,8 and XN,u = XN ~ XN,s, 

we have 

R ~ 

(i) \XN,s( n + Q) — XN,s( n )\ < — for every n£Z^, where n + Q is taken mod N ; 

(ii) ll?CJV,«||r72(z-) < 9. 

Moreover, for all 9 and 9' , and every N such that 4>n,0 an d <pN,9' are defined, we have 

(16) if0<9'<9, then > > for every (G%. 

The monotonicity property (|16p plays a central role in the derivation of Theorem 11.61 
from Theorem 16.11 in Section 16.91 This is one of the reasons why we construct the kernels 
4>N,e explicitly in Section T3.3I 

The values of Q and R given by Theorem 13.11 will be used later in Section [UJ and 
they do not coincide (in fact, they are much smaller) with the values of Q and R in 
Theorems 16.11 and 11.61 

3.1. Katai's orthogonality criterion. We start with the key number theoretic input 
that we need in this section and which will also be used later in Section [5j 

Lemma 3.2 (Orthogonality criterion |20]). For every e > and every Kq G N there 
exists 5 := 8(Ko,e) > and K := K(Kq,e) > Kq such that the following holds: If 
N > K and f : [N] — > C is a function with \ f\ < 1, and 

max |E n6 N 7V / p ,j ] /(pn)7(p / n)| < S, 

p,p primes 
K <p<p'<K 

then 

sup \^ne[N]X(n)f(n)\ < e. 
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The dependence of 5 and K on Kq and e can be made explicit (for good bounds see 
[1]) but we do not need such extra information here. 

Lemma [3.21 is not strictly speaking contained in the paper |20| that shows only asymp- 
totic results. Moreover, Katai considers only the case of a function of modulus 1, written 
as e(t(n)), but the estimates are valid without any change for functions of modulus at 
most 1. For completeness we give the derivation. 



Proof. Let Kq G N and e > be given. In |20j . the letter / is used to denote a multi- 
plicative function that we denote here by \. Let K > Kq be a positive integer that will 
be made explicit below and will depend only on Kq, e. We write V for the set of primes 
p with Kq < p < K and let 

A V 

per L 

Let TV G N, x e M, and / : [N] -> C be a function with |/| < 1. We let 

S(N) := X(n)f(n). 

ne[N] 



After correcting some typos in |20|n inequality (3.8) of |20j reads as follows 
(17) 'y ^c + c'.^ + c" £ 1| £ f^Wn) 

p^p'&V n<min(N/p,N/p') 

for some positive universal constants c,c',c". For Kq given, we choose K sufficiently 
large so that cA^ 2 + d 'A^, 1 < e 2 /2, and then choose 5 := e 2 A 2 ?/{2c"\V\ 2 ). Thus defined, 
note that K and 5 depend on Kq and e only. Assuming that the function / satisfies the 
hypothesis of the lemma, and inserting the previous bounds in (|17p we get the desired 
estimate. □ 

3.2. An application to Fourier coefficients of multiplicative functions. Next, 
we use the orthogonality criterion of Katai to prove that the Fourier coefficients of the 
restriction of a multiplicative function on an interval [N] are small unless the frequency 
is close to a rational with small denominator. Furthermore, the implicit constants do not 
depend on the multiplicative function or the integer N. 

Corollary 3.3 (U 2 non-uniformity). For every 9 > there exist positive integers Q := 
Q(9) and V := V{&) such that, for every sufficiently large N , depending only on 9, for 
every x £ jM, and every £ G we have the following implication 



(18) if IxaKOI ^ then 



N 



N ' 



Proof. Let 5 := 5(1,9) and K := K(l,9) be defined by Lemma and let Q = K\. 
Suppose that N > K. Let p and p' be primes with p < p' < K and let £ G Z^. Since Q 
is a multiple of p' — p we have 

WQHH < JL-.\\(j/-p)$/N\\ < Q \\{p'-p)C/Nl 

Since iV < 20 £N, we deduce 

|E n6[LWJ] e(M/N)e(-pnZN)\ < < 2KQ „ < ' /v "^ 



NW-p)Z/N\\ N\\QZ/N\\ N\\QZ/N\\ 



'The sign + and the universal constants are missing on the right hand side. 
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Let V = 40eK/5. If ||QC/^II > QV/N, then the rightmost term of the last inequality is 
smaller than S, and thus, by Lemma 13.21 we have 



| E ne[jv]Xiv(n)e(-ne/iV)| = ^|E ne[JV]X (n)e(-</iV)| < 



contradicting (|18p . Hence, ||Q£/iV|| < QV/N, completing the proof. 



□ 



3.3. Some kernels. Next, we make some explicit choices for the constants Q and V of 
Corollary 13.31 This will enable us to compare the Fourier transforms of the kernels (J)n,6 
defined below for different values of 9 and to establish the monotonicity property (|16p . 
For every 9 > we define 

A(N,0):={teZz:3 X £M such that \Si(0\ > 2 } ; 



WW, g, 0) := max N 



N 



(19) 
(20) 



Q{6) := mini A;! : limsup W(AT, fe!, (9) < +oo) ; 
ken I tv^+oo J 



V{9) := 1 + 



— - limsup W(N,Q(0),6) 



It follows from Corollary 13.31 that the set of integers used in the definition of Q{9) is 
non-empty, hence Q(9) is well defined. The essence of the preceding definitions is that 
for every real number V > V(9) — 1, the implication (|18p is valid with 9 2 substituted for 
9, V for V, Q{9) for Q, and for every sufficiently large N. Furthermore, it follows from 
these definitions that for < 9' < 9, we have Q{9') > Q(9), and thus 

(21) for < 9' < 9, the integer Q{9') is a multiple of Q(9). 
Moreover, it can be checked that 

(22) V(9) increases as 9 decreases. 

Next, we use the constants just defined to build the kernels <f>N,e of Theorem 13.11 We 
recall that a kernel on Zr; is a non-negative function <j) on Z»j with E ng ^ - 4 > ( n ) = 1. We 



define the spectrum of a function <fi to be the set 

Spec(0) := {£€Z#:£(0^0}. 
For every m > 1 and A?" > 2m the "Fejer kernel" /jv,m on ^jy * s defined by 

fN,m(x) -- 



M).(,£). 

m ' TV 



— m<£<m 

The spectrum of /jv,m is the subset (— m, m) of Z^. Let Qn{&)* be the inverse of <2(#) 
in Z^, that is, the unique integer in {1, . . . ,N — 1} such that Q(9)Qn(9)* = 1 mod iV. 
For JV > 2Q(9)V(9)\9- 2 ] we define 

(23) 07v,e(x) = f N ,Q(e)v(e)\8-^(QN(0)*x). 

An equivalent formulation is that fN,Q(8)V(8)\8- 4 ]( x ) = 4>N,e{Q{Q)x). The spectrum of 
the kernel 4>n 8 is the set 



(24) 

and we have 
(25) 



iV 



< 



Q(6)V(0)\0- 



if £ G 2at )( 
otherwise. 
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We remark that the cardinality of 3^,e depends only on 8. 

3.4. Proof of Theorem 13.11 First, we claim that property (fl~6|) of Theorem 13.11 holds. 
Indeed, suppose that 8 > 8' > and that N is sufficiently large so that (pN,8' and <pN,e 

are defined. We have to show that (f>N,0'(£) > <^iV,e(£) f° r every £. Using (|2ip and (|22R 
we get that z>n,0' contains the set Sjv,e- Thus, we can assume that £ belongs to the latter 
set as the estimate is obvious otherwise. In this case, the claim follows from (|2ip . (|22~|) . 
and the formula ([25]) giving the Fourier coefficients of <pN,e- 

Next, we show the remaining assertions @j and (JTTJ) of Theorem 13.11 for the decompo- 
sition 

XN,s ■= 4>N,e * XN, XN,u '■= XN ~ 4>N,e * XN- 
Let 8 > 0, assume that N is sufficiently large depending only on 8, and let Q = Q(6), 
V = V{8), <f> Nfi , E = E Nfi) be defined by (JE), (J2DD, d23j) , (JMD respectively. 

For every x S At, if |xiv"(£)| > # 2 > then by the definition of Q we have ||Q£/iV|| < 
QV/N and thus <fo^ e (£) > 1-8* by (J25|). It follows that |^(C)-0jvJ*Xj\t(OI < # 4 < # 2 - 
This last bound is clearly also true when IxaKOI < 8 2 and thus using identity ([3]) we get 

||XJV - 4>N,e * XnWu*^) = E IXzvCO ~ ^,0 * Xw(0| 4 < 



where the last estimate follows from Parseval's identity. Hence, \\xn — ( Pn,9*Xn\\u 2 (Z~) — 
8, proving Property (JTTJ) . 

Lastly, for x G A4 and n£Z^, using the Fourier inversion formula and the estimate 
\e(x) — 1| < we get 



I (for,* *Xiv)(^ + Q) - (<?W,0 *xjv)(«)| < E ■ 



A r 



where the last estimate follows from (|24[) . As remarked above, |Hjv,e| depends only on 8, 
thus the last term in this inequality is bounded by R/N for some constant R that depends 
only on 8. This establishes Property (P of Theorem 13.11 and finishes the proof. □ 

4. MODIFICATIONS OF THE INVERSE AND FACTORIZATION THEOREMS 

In this section, we state and prove some consequences of an inverse theorem of Green 
and Tao [15J and a factorization theorem by the same authors [17J that are particu- 
larly tailored to the applications being pursued in subsequent sections. These results 
combined, prove that a function that has [7 3 -norm bounded away from zero, either has 
C/ 2 -norm bounded away from zero, or else correlates in a sub-progression with a totally 
equidistributed 2-step polynomial nilsequence of a very special form. 

Essentially all definitions and results of this section extend without important changes 
to arbitrary nilmanifolds. We restrict to the case of 2-step nilmanifolds as these are the 
only ones needed in this article and the notation is somewhat lighter in this case. 

4.1. Nilmanifolds. We introduce some notions from |17j . We record here only the 
properties that we need in this section and defer supplementary material to the next 
section and to Appendix IA1 

Let X = G /T be & 2-step nilmanifold. Throughout, we assume that G is a connected 
and simply connected 2-step nilpotent Lie group and T is a discrete cocompact subgroup 
of G. We view elements of G/T as "points" on the nilmanifold X rather than equivalence 
classes, and denote them by x, y, etc. The nilmanifold X is endowed with a base point 
ex which is the projection on X of the unit element of G. The action of G on X is 
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denoted by (g, x) i— > g ■ x. The Haar measure mx of X is the unique probability measure 
on X that is invariant under this action. 

We denote by m the dimension of G and by r the dimension of G% := [G,G], the 
commutator subgroup of G. We implicitly assume that G is endowed with a Mal'cev 
basis X: X is a basis (£1, . . . , £ TO ) of the Lie algebra g that has the following properties: 

(i) The map 

V>: ,t m ) h-> exp(ii£i) • . . . • exp(t m £ m ) 

is a homeomorphism from M m onto G; 

(ii) G 2 = ^({0} m - r xR r ); 

(hi) r = i)(z m ). 

Let 5 be endowed with the Euclidean structure making X an orthonormal basis. This 

induces a Riemannian structure on G that is invariant under right translations. The 

group G is endowed with the associated geodesic distance, which we denote by do- This 
distance is invariant under right translations. 

Let the space X = G/T be endowed with the quotient metric dx- Writing p: G — >■ X 
for the quotient map, the metric dx is defined by 

dx(x,y) = inf {d G (g,h): p(g) = x, p(h) = y). 

g,h£G 

Since T is discrete it follows that the infimum is attained. More precisely, there exists a 
compact subset Fq of G, such that 

(26) for every x,x' £ X there exist h, h £ Fq with dc(h, h') = dx{x, x'). 

We frequently use the fact that if / is a smooth function on X, then ||/||Li P (Y) — ll/llc 1 (A)- 
We also use the following simple fact: 

Lemma 4.1. For every bounded subset F of G there exists a constant H > such that 

(i) dx(g • x, g ■ x') < Hdx(x, x') for all x,x' £ X and every g £ F; 

(ii) for every f £ C m (X) and every g £ F, writing f g {x) := f(g ■ x), we have 
\\fg\\c m {X) < -^ll/llc m (X)- 

Proof. Let Fq be as in (|26p . Since the multiplication G x G — > G is smooth, its restriction 
to any compact set is Lipschitz and thus there exists a constant H > with dc(gh, gh') < 
Hdc(h, h') for every g £ F and h, h! £ Fq. The first statement now follows immediately 
from ([26|) . Since the map (g, x) \— > g ■ x, from K x X to X, is smooth, the second 
statement follows as well. □ 

Definition (Vertical torus). We keep the same notation as above. The vertical torus is 
the sub-nilmanifold G2/(G2 H T). The Mal'cev basis induces an isometric identification 
between G2 and W, and thus of the vertical torus endowed with the quotient metric, 
with T r endowed with its usual metric. Every k £ W induces a character u 1— » k • u of 
the vertical torus. A function F on X is a nilcharacter with frequency k if F(u ■ x) = 
e(k • u)F(x) for every u £ T r = G2HG2T) and every x £ X. The nilcharacter is 
non-trivial if its frequency is non-zero. 

Definition (Maximal torus and horizontal characters). Let X = G/T be a 2-step nil- 
manifold, let m and r be as above, and let s := m — r. The Mal'cev basis induces an 
isometric identification between the maximal torus G/(G2~T), endowed with the quotient 
metric, and T s , endowed with its usual metric. A horizontal character is a continuous 
group homomorphism ij: G — > T with a trivial restriction on T. A horizontal character 77 
factors through the maximal torus, and we typically abuse notation and think of 7/ as a 
character of the maximal torus, and identify r\ with an element k of Z s by the following 
rule a 1— > k • a := k\a\ + • • • + k s a s for a = (pci, . . . , a s ) £ T s and k = (ki, . . . , k s ) £ Z s . 
We define \\rj\\ := \k\\ + ■ ■ ■ + \k s \. 

Note that a trivial nilcharacter is any function that factors through the maximal torus. 
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4.2. A corollary of the inverse theorem. We start by stating the inverse theorem of 
Green and Tao for the £7 3 -norms. 

Theorem 4.2 (The inverse theorem for the t/ 3 (ZAr)-norm Theorem 12.8]). For 
every e > there exist 5 := 5(e) > and a 2- step nilmanifold X := X(e), such that for 
every sufficiently large N, depending only on e, and for every f : 7Ln — > C with \ f\ < 1 
and ||/||[/3(2 N ) > £ > there exists a function <£: X — > C w/ii/i || < 5||Lip(X) — 1 anc ^ an e/ement 
g £ G such that \E n& ^f (n)^(g n ■ ex)\ > 5. 

A sequence §(g n - ex) of this form where is only assumed to be continuous is defined 
as a 2-step basic nilsequence in [2]; if in addition we assume that <J> is Lipschitz, then we 
call it a nilsequence of bounded complexity a notion first used in [15j. 

We state a corollary of this result that is better suited for our purposes: 

Corollary 4.3 (Modified f/ 3 -inverse theorem). For every e > there exist 5 := 5(e) > 0, 
m := m(e), and a finite family % := rl(e) of 2-step nilmanifolds, of dimension at most 
m and having a vertical torus of dimension 1 (identified with T as explained above), such 
that: For every sufficiently large N, depending only on e, if f ': 7Ln — > C is a function 
with |/| < 1 and \\f\\u 3 (z N ) > e ; then at least one of the following conditions hold 

(0 \\f\\u^ N) 

(ii) there exist a nilmanifold X belonging to the family H, an element g G G, and a 
nilcharacter ^ of X with frequency 1, such that 

||*|| C 2-(x) < 1 and \E ne[N] f(n)y(g n ■ e x )\ > 5. 

Proof. Let e > 0. In this proof the constants 5, 5' , 5" , C, C%, . . . , depend only on e. 

Let X = G/T and 5 be given by Theorem 14.21 Let m be the dimension of X, and r 
be the dimension of G<i- The maximal torus G/(G 2 T) has dimension s := m — r. As 
mentioned above, we identify G2 with M. r and G% H T with Z r , thus the vertical torus 
G 2 /(G 2 n r) is identified with T r . 

Let /: Ztv — > C be a function with |/| < 1 and ||/||[/3 > e. Let <I> and g be given by 
Theorem I4.2[ i.e. 

(27) \E ne[N] f(n)$(r-ex)\>5 

and || < ^|lLi p (x) — 1- We can assume that ||$Hc2m(x) — I n( ieed, there exists a function 
<£' with ||<1? — ^'Hoo < 5/2 and || ( I ) '|| C 2m(x) < G f° r some constant C depending only 

on 5 and X and thus only on e. Up to a change in the constant 5, the conclusion of 
Theorem 14.21 remains valid with <!>' substituted for <3?. 



We need some preliminary definitions. For k G Z r , the character k of G 2 /(G 2 D T) 
induces a character of G2 given by some linear map <^>k : G2 = W — > M.. Let Gk be the 
quotient of G by the subgroup ker(0k) of G2 and be the image of T in this quotient. 
Then is a discrete and co-compact subgroup of Gk and we let X^ : = Gk/Tk- We 
write tt^: X ^ X^ for the natural projection and let ex k := ^k(ex)- 

If k is non-zero, then X^ is a non-Abelian 2-step nilmanifold. The commutator Gk,2 of 
Gk is the quotient of G2 with the kernel of c/>k and thus has dimension 1 and the vertical 
torus Gk/^Gk^Tk) of has dimension 1. If k is the trivial character, then X^ is the 
maximal torus G/(G2T) of X and thus is a compact Abelian Lie group. 

We recall the definition of the vertical Fourier transform. The restriction to G2 H T of 
the action by translation of G on X is trivial, and thus this action induces an action of 
the vertical torus on X by (u, x) 1— > u • x for u G T r and x G X. The vertical Fourier 
series of the function <3? is 

$ = ^2 where $]<(x) = / <I>(u • x)e(— k • u) dmjr (u). 
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For every k £ Z r , the function <3?k is a nilcharacter with frequency k and thus can be 
written as 

$k = I'k o vr k 

for some function ^k on X^. If k 7^ 0, then <I>k is a nilcharacter of X^ with frequency 
equal to 1. Moreover, for every k £ Z r , since H^H^r^x) < 1 5 then [|$k|lc2»»(jn — 1 an d 
l^kC^)! < Ci(l + ||k||) _2m for every k £ Z r and every x £ X. Since m > r, there exists 
a constant C2, depending only on X and (5, and thus only on e, with 

|$k(a?)| < V 2 for every i£l 

k; |]k]|>C 2 

Replacing $ with its vertical Fourier series in (|27p . this last bound implies that there 
exists 5', depending only on X and S, and thus only on e, such that 

(28) |E„ 6 [JV]/(n)*k(flE • e~)\ = \E ne[N] f(n)^(K ■ e~)\ > 6' 

for some k £ Z r with ||k|| < C2, where is the image of g in Gk under the natural 

projection. Since <J>k = ^k ^k and ||$k|lcam(x) — L we nave that 

ll^fc|lc 2m (X k ) — C3 f° r every k € Z r with ||k|| < C%. 
We define the family T~L of 2-step nilmanifolds as follows 

H ■= {X k : k/0, ||k|| < C 2 }. 

It remains to show that either Property (P or Property (|n|) is satisfied. 

Suppose first that the element k in (j28|) is non-zero. We have ||C,T 'Q/klle^fXk) — 1 
and \^n€[N]f ( n )C^ 1 ^f'k.(g^ex k )\ > C^ 1 5', showing that Property (jn]) is satisfied. 

Otherwise, (|28p holds for k = 0, in which case Xk is the maximal torus Gj (G^T) = T s 
and ^0 is a function on T s with || v I , o||c 2m (T s ) — C*3- Let a be the projection of g in T s . 
For some constant C4, we have 

2 l^oWl < C 4 ; 
5' < \E ne[N] f(n)* (na)\ < £ |^)P n6[J v]/(n) e(n^ • ct)|. 

Thus, there exists £ £ Z s with |E ne[7V] /(ra) e(n£ ■ a)\ > 5'/C A . Since the l 4 / 3 (Z N )- 
norm of the Fourier coefficients of any linear phase is bounded by a universal constant, 
arguing as in the proof of Lemma [2.61 we deduce that ||/||{/ 2 (Zjv) — f° r some constant 
5" := 5"(e), showing that Property is satisfied. This completes the proof. □ 

4.3. A modification of the factorization theorem. We plan to use a decomposition 
result for polynomial sequences on nilmanifolds from |17j . We keep the notation and the 
conventions of Section [4.11 and introduce some additional ones next. 

Definition (Total equidistribution). Let X be a nilmanifold. We say that the finite 
sequence g: [N] — > G is totally e-equidistributed on X if 

|E ne [Ar]lp(ri)-F(5(n) • ex) I < £, 

for all F £ Lip(X) with ||i ? ||Lip(x) — 1 an d / F dmx = 0, and all arithmetic progressions 
Pm[N}. 

Modulo a change in the constants, our definition of total equidistribution is equivalent 
to the one given in |17) . where the claimed estimate is \K n< ^pF{g{n) ■ ex)\ < £ for all 
F £ Lip(X) with ||i ? ||Lip(x) — 1 an d / F dmx = 0, and arithmetic progressions P in [N] 
with \P\> eN. 
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Notation. If (g(n)) is a sequence in G and h G Z, we denote by (dh(g)) the sequence 
denned by dhg(n) := g(n + h)g{n)~ l for re G N. 

Definition (Polynomial sequences). A polynomial sequence in a nilpotent group G is a 
sequence g: N — >• G that has the form g(n) = a^ 1 ^ ■ ■ ■ ct? , where ai, . . . ,a& 6 G and 
Pi,... ,pjfe G Z[i]. 

An equivalent definition |17l Lemma 6.7] is that there exists an integer d such that 
dh d+1 dh d • • • 9hi9{ n ) = 1g f° r all n G N and all /ti, . . . , /i^+i G Z. The smallest integer d 
with this property is the degree of the sequence. 

We say that a sequence (g(n)) in G is a degree 2 polynomial sequence with coefficients 
in the natural filtration if it can be written as 

g(n) = 909192 where g ,g 1 G G and g 2 G G 2 . 

Definition (Rational elements). We say that an element g G G is Q-rational for some 
Q G N if there exists m < Q with g m G T. We say that g is rational if it is Q-rational 
for some Q G N. 

Rational elements form a countable subgroup of G \17\ Lemma A. 12]. 

Definition (Smooth and rational sequences). Given a nilmanifold G/T and M,N G N 
with M < N, we say that 

• the sequence e : [N] — > G is (M, N)-smooth if for every n G [A] we have 
d G (e(n), 1 G ) < M and d G (e(n),e(n - 1)) < Af/JV; 

• the sequence 7: [iV] — >• G is M-rational if for every n G [iV], 7(n) is M-rational. 

In |17| the next result is stated only for functions of the form oj(M) = M~ A , but 
the same proof works for arbitrary functions u) : N — > M + . For the notion of a rational 
subgroup that is used in the next statement we refer the reader to Appendix |A"1 

Theorem 4.4 (Factorization of polynomial sequences [17, Theorem 1.19]). Suppose that 
X := G/T is a 2-step nilmanifold. For every M G N there exists a finite collection J-{M) 
of sub-nilmanifolds of X, each of the form X' := G'/V, where G' is a rational subgroup 
of G and V := G' n T, such that the following holds: 

For every function uj: N — > M + and every AIq G N, there exists M\ := Mi(Mq,X,u}), 
such that for every N G N, and for every degree 2 polynomial sequence (fl , (^)) n e[A r ] ^ n ^ 
with coefficients in the natural filtration of G, there exist M G N with Mq < M < M±, a 
nilmanifold X' belonging to the family J-{M), and a decomposition 

g {n) = e(n)g'(n)j(ri), n G [N], 

where e, 5', 7: [N] — > G are degree 2 polynomial sequences with coefficients in the natural 
filtration of G, that satisfy 

(i) e is (M,N) -smooth; 

(ii) (g'(n)) n £[N] takes values in G' , and the finite sequence (g'(n)-ex') n e[N] * s totally 
co(M)-equidistributed in X' with the metric dx' ; 

(iii) 7: [N] —7- G is M-rational, and (7(w)) n e[iV] * s periodic with period at most M. 

Remark. We emphasize that in the previous result and subsequent corollary, the number 
M\ is independent of and the family {T{M))m^m is independent of oj and N . The 
choice of the sub-nilmanifold X' may depend on N, but it stabilizes for large enough N. 

We will use the following corollary of the previous result that gives a more precise 
factorization for a certain explicit class of polynomial sequences. 

Corollary 4.5 (Modified factorization). Let X := G/T be a 2-step nilmanifold with 
vertical torus of dimension 1. For every M G N there exists a finite collection J~{M) of 
sub-nilmanifolds of X, each of the form X' := G' /T' , where T' := G' (~)T and either 
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(i) G' is an Abelian rational subgroup of G; or 

(ii) G 1 is a non-Abelian rational subgroup of G and G' 2 /(G 2 flF') has dimension 1, 

such that the following holds: 

For every function uj: N — > R + and every Mq G N ; there exists Mi := M\(Mq, X,u) , 
such that for every N G N and every g G G, there exist M G N with Mq < M < Mi, a 
nilmanifold X' belonging to the family J-(M), and a decomposition 

g n = e(n)g'(n)j(n), n G [N], 

where e,g',j: [N] — > G are polynomial sequences that satisfy 

(iii) e is (M, N) -smooth; 

(iv) (s ,/ (^))ne[Af] takes values in G', has the form 

(29) g (n) = g'^g'^g^ where g^g'\,g 2 ^ Gr , and moreover g 2 G G' 2 in case ([n]), 

and (g'(n) ■ ex') n e[N] * s totally u(M)-equidistributed in X' with the metric dx* ; 

(v) 7: [N] —> G is M -rational, and (7(w)) n e[iV] ^ s periodic with period at most M. 

Proof. Let the integers M\,M and the nilmanifold X' = G'/V belonging to the family 
J-{M) be given by Theorem l4.4l Note that the sequence (g n ) n ^\N] i s a degree 2 polynomial 
sequence in G with coefficients in the natural filtration. Let (<? / (w))ng[A''] be the sequence 
given by the decomposition of Theorem 14.41 This sequence is a polynomial sequence in 
G with coefficients in the natural filtration of G and thus it can be written as g'(n) = 

g^g'xg^ f° r some g'Q,g[ G G and some g 2 G G2. It remains to show that this sequence 
has the form ()29p . i.e. that g^g'i^g^ G G' and furthermore that g' 2 G G' 2 in case (ii). 

Since g' Q = g'(0) we have g' G G'. Recall that d±g(n) = g(n + \)g(n)~ l . Using the fact 
that G2 is included in the center of G we obtain 

dig'{n) = g'M^g? and dig 1 {n) = g' 2 . 

It follows that g[ and g 2 G G' . 

If we are in case ([1]), then G' is Abelian and we are done. Suppose now that we are in 
case (ii) where G' is non-Abelian. Then G' 2 is a non-trivial subgroup of G2. Moreover, 
G 2 is closed and connected, and by hypothesis G2 is isomorphic to the torus T. It follows 
that G' 2 = G2. Hence, g 2 G G 2 . This shows that the sequence (fl ,/ (^)) n G[A r ] has the 
required properties and completes the proof. □ 



5. CORRELATION OF MULTIPLICATIVE FUNCTIONS WITH NILSEQUENCES 

The main goal of this section is to establish some correlation estimates needed in the 
proof of the decomposition results given in the next section. We show that multiplica- 
tive functions do not correlate with a class of totally equidistributed 2-step polynomial 
nilsequences. The precise statements appear in Propositions 15.31 and 15.41 

5.1. Quantitative equidistribution. We start with a quantitative equidistribution re- 
sult for polynomial sequences on nilmanifolds by Green and Tao |17| that strengthens an 
earlier qualitative equidistribution result of Leibman [22J. 

Definition. If g: [N] — > T is a finite polynomial sequence in T, of the form 

g(n) = «o + ct\ n + a 2( )+ --- + CKd(,) where d G N and qq, . . . G T, 



2 J \d 
then the smoothness norm of g is defined by 



\g\\c°°{N] ■= ^f£ d NJ W a 3 
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Theorem 5.1 (Quantitative Leibman Theorem [17J). Let X := G/T be a 2-step nil- 
manifold. Then for every small enough e > there exists D := D(X, e) > with the 
following property: For every N G N, if g: [N] —> G is a polynomial sequence of degree 
at most 2 such that the sequence {gin) -ex)n£{N] * s n °t £- equidistributed in X, then there 
exists a horizontal character rj := f](X, e) such that 

< \\r]\\ < D and \\rj o <7||o°°[iV] < D, 

where we abuse notation and think of rj as a character of the maximal torus and g(n) as 
a polynomial sequence on the horizontal torus. 

We plan to use a partial converse of this result. 

Lemma 5.2 (A partial converse to Theorem 15. ip . Let X := G/T be a 2-step nilmanifold. 
There exists C := C(X) > with the following property: If N is sufficiently large, 
depending only on X, and (g(ra)) ng nvi is a degree 2 polynomial sequence in G such that 
there exists a non-trivial horizontal character rj of X with \\r]\\ < D and H^offHc 00 ^] — D, 
then the sequence (g(n) ■ ex) n &[N] * s n °t totally C D~ 2 -equidistributed in X. 

Proof. Since [|r/ o ^Hcoonyi < D we have 

rj(g(n)) = ct^f . J for some aj G T with ||qj-|| < — - for j = 1,2. 

0<J<2 ^ ' 

It follows that 

|l — 6(77(5(77,))) I < — for l<n<ci — 
where c\ is a universal constant. Thus, 

1 

> -. 

~ 2 

Furthermore, since ||?7|| < D, the function x 1— > e(r](x)), defined on X, is Lipschitz with 
constant at most C'D for some C 1 := C'(X), and has integral since r] is a non-trivial 
horizontal character. Therefore, the sequence (g(n) ■ ex) n e[N] 1S n °t totally (CD~ 2 )- 
equidistributed in X where C = C\/C' . □ 

We can now prove the two main results of this section that give asymptotic orthogonal- 
ity of multiplicative functions to some totally equidistributed nilsequences. These results 
will be used later in the proof of Theorem 16. II to treat each of the two distinct cases aris- 
ing from an application of Corollary 14.51 Both proofs are based on Katai's orthogonality 
criterion (Lemma I3.2p and the quantitative Leibman Theorem (Theorem 15. ip . 

Proposition 5.3 (Discorrelation estimate I). Let X := G/T be a 2-step nilmanifold and 
t > 0. There exists a := o~(X,t) > with the following property: For every sufficiently 
large N, depending only on X andr, if (g{n)) n ^m] is a degree 2 polynomial sequence in 
G with coefficients in the natural filtration that is totally a -equidistributed in X, then 

sup \E ne[N] lp(n)x(n)<&((g(m + n) ■ e x ))\ < t 

where the sup is taken over all integers m, multiplicative functions x ^ M., functions 
$ G Lip(X) with || < I ) ||Lip(X) — 1 an d J & dmx = 0, and arithmetic progressions P C [N]. 

Proof. Let r > and suppose that 

(30) \K ne[N] l P (n)x(nMg(m + n) ■ e x )\ > r 

for some m G Z, x £ -M, <5 G Lip(X) with || < I ) ||Lip(x) ^ 1 an d / & dmx = 0, and 
arithmetic progression P C [N]. 

We endow the nilmanifold X x X with the cartesian product structure and we let D be 
associated to r (r plays the role of e) and the nilmanifold X x X as in Theorem 15.11 We 



\^n<l cl N/D\e{v(g{n)))\ 
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also let K and 5 be denned as in Lemma [3, 21 with D substituted for Kq and r substituted 
for e. Applying this lemma to the sequence (lp(n)<J>(g(m+n))) ng n\n, estimate (|30p gives 
that there exist primes p, p' with D < p < p' < K such that 

| E ne[|AT/ P 'J] lp(pn)$(g(m + pn) ■ e x ) lp{p'n)<&(g(m + p'n) ■ e x )\ > 5- 

Since lp(pn)lp(p'n) = lp 1 (n) for some arithmetic progression Pi C [L-^/p'J] we have 

(31) |E ne [ LA r /p /j]l Pl (n) ^(^(m +pn) ■ e x )${g(m + p'n) ■ e x )\ > 8. 

On the other hand, since J <I> dm x = 0, we have J* $ $ dm XxX = 0. Furthermore, 
||<I> (g) < &||Lip(A"xX) ^ 1; an d thus (13111 gives that the sequence 

(g{m + pn) ■ e x ,g(m + p'n) • ex)„ e [Liv/p'J] 

is not totally <5-equidistributed in X x X. By Theorem 15.11 there exists a non-trivial 
horizontal character n of X x X with 

(32) \\n\\ < D, and 

(33) \\n(g(m + pn) ■ e x ,g(m + p'n) ■ e x )\\ coo[lN/pli] < D. 

Let 7r is the natural projection on the maximal torus G/^G^V) which we identify with 

T s for some s G N. Recall that g{n) = g^g^g^ for some go,gi G G and g 2 £ G2. Let 
cto := tt((7o), cki := Tr(ffl)- Since ^(52) = 0) we have ir(g(n)) = olq + ncei, and since the 
horizontal character 77 has the form 

n(x, x ) = k ■ 7r(x) + k' • 7r(a/) 

for some k, k' G 7L S with < ||k|| + ||k'|| = \\n\\ < D, we get that 

(34) n(g(m + pn) ■ e x ,g(m + p'n) ■ ex) = n{pk + p'k') ■ cki + (k + k')(ao + ma.\). 
Putting together (|33p and (|34|) gives 



llfrk + pVJ.^IIS^S— . 

Let 6 be the horizontal character of X given by 9(x) = (pk + p'k') • ir(x). We have 
o g(n) = n(pk + p'k') • a.\ and thus \\6 o <7||c°°[jv] < 2DK. Since p and p' are primes 
and ||k||, ||k'||< D < p < p' , we have pk + p'k' 7^ and 0^0. Furthermore, \\9\\ = 
||pk + p'k' 1 1 < DK. It follows from Lemma 15,21 that there exists a depending only on 
D,K,X, and thus only on X and r, such that the sequence (g(n))n£[N] i s n °t totally 
cr-equidistributed in X. This completes the proof. □ 

Proposition 5.4 (Discorrelation estimate II). Let s G N and r > 0. There exists 
a := ct(s,t) > with the following property: For every sufficiently large N, depending 
only on s and t, if (g(ra))„ g jjv] * s a polynomial sequence in T s of the form 

g(n) = a + "in + a 2 ^ J > «i £ T s , i = 0,1,2, 

is totally a-equidistributed in T s , then 

sup |E ne[iV ]lp(n)x(ri,)$(g(m + n))| < r 

where the sup is taken over all integers m with \m\ < iV, multiplicative functions x G -A4, 
$ G Lip(X) m'i/t 1 1 1 1 Lip.(jsr) < 1 anc ^ / & dm x = 0, and arithmetic progressions P C [N]. 

Proof. In this proof C%, C2, ■ ■ ■ are constants that depend only on s and r. 
Without loss of generality, we can assume that ||^ > ||c 2s (A) — 1- Suppose that 

|E ne[ jv]lp(n)x(n)$(g(m + n))| >r 
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for some r > 0, integer m with \m\ < N , y G M, $ G Lip(X) with || < I ) ||Lip(x) — 1 and 
J & dmx = 0, and arithmetic progression P C [iV]. We have 

[jv]lp(n)x(n)e(k • g(m + n))| 

Cg 

- 2 Y~ |j02l! 1E nG[iV] 1 p( n )xHe(k-g(m + n))| 

for some constant Co := Cq(s). It follows that there exist constants C%, 9 := 0(s, r) > 0, 
and k G Z s , such that 

||k|| < Ci and |E ne [jv]lp(n)x(n)e(k • g(m + n))| > 0. 

Let <5 and iT be defined by Lemma 13.21 with 1 substituted for Kq and 6 substituted for 
e. Note that 5 and K depend on s and r only. There exist primes p,p' with p < p' < K 
such that 

lEneaAVp'JlMP^Mf 1 '™) 6 ^- (g("i + pn) -g(m + p'n)))| > 5. 

Writing f3\ = k • ai, /3 2 = k • «2, and lp(pn)lp(j/n) = lp^ra) where Pi C [[N/p'\] is 
an arithmetic progression, we can rewrite the previous estimate as 

\^ne[[N/ P >}]lPi(n) e(u(n))| > 6 

where 

n(n)= Q/32(p 2 -p' 2 ) + n(/3 2 (Q - ) + (m/3 2 + ft ) (p - p')) • 

Since [^/VJ > N/2K, the sequence (ii(n)) ne nv| is not totally 5/2i^-equidistributed in 
the circle. By the Abelian version of Theorem 15. 1| it follows that there exists an integer 
I with < I < D := D(5/2K) such that 



D 



P\ P 



(p 2 -p' 2 )\\ < ^ and (; )) + +ft)(p-j0) 



< — . 

~ N 



We deduce first that /3 2 is at a distance < C2/N 2 of a rational with denominator < C3, 
and then that /3i is at a distance < C4/N of a rational with denominator < C5 (here we 
used that \m\ < A r ). Hence, there exists a non-zero integer I', bounded by some constant 
C 6 , such that \\l%\\ < C 7 /N 2 and p'/3i|| < C 8 /iV. Taking k' = Z'k, we have 

< ||k'|| < dC 6 ; ||k'-a 2 ||<-^; ||k / -at 1 || < 

Using this and Lemma I5.2[ we deduce that the sequence (s(n)) n e[N] i s n °t totally a- 
equidistributed for some a > that depends only on s and r, completing the proof. □ 

We give one more application of Theorem 15. II that will be needed in the next section: 

Lemma 5.5 (Shifting the nilmanifold). Let X := G/Y be a 2-step nilmanifold, G' be a 
rational subgroup ofG,h€zGbea rational element, X' := G' -ex, ey := h-ex, andY := 
G' ■ ey. For every e > there exists 5 := 5{G' ,X,h,e) > with the following property: 
If (9'( n ))ne[N] * s a polynomial sequence in G' of degree at most 2, such that the sequence 
(g'(n) ■ ex)ne[N] * s totally 5-equidistributed in X' , then the sequence (g'(n) ■ eY) n e[N] * s 
totally e-equidistributed in Y . 

By Lemma IA.6I in the Appendix, T D G' is co-compact in G' and thus X' is a closed 
sub- nilmanifold of X. In a similar fashion, (hTh -1 ) D G' is co-compact in G' and Y is a 
closed sub-nilmanifold of X. 
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Proof. In this proof, C%, C2, . . . are constants that depend only on G', X, and h. 

By Lemma lA, 51 in the Appendix, the group T P hTh^ 1 P G' has finite index in the two 
groups r n G' and hTh^ 1 P G' . We write 

Z' := G'/G' 2 (T n G'), Zi := G'/G' 2 (T P /ir/i" 1 n G'), and Z 2 := G'/G'^/ir/j- 1 n G'). 

Then Z' is the horizontal torus of X' , the nilmanifold Y can be identified with G' / (hTh~ l C\ 
G'), and thus Z 2 is the horizontal torus of Y. Let p: Z\ — > Z' and q: Z\ — )■ Z 2 be the 
natural projections. These group homomorphisms are finite to one. 

Let e > 0, and suppose that the polynomial sequence (g'(n) ■ ey) ng [7v] nas degree 
at most 2 and is not totally e-equidistributed in Y. We denote by D the integer that 
Theorem 15.11 associates to e and Y. Then there exists a non-trivial horizontal character 
77 of Y, with 

(35) ^ |M| < D and ^(^(n))!!^^ < D. 

Recall that r\ factors to a character of the horizontal torus Z 2 of Y; we slightly abuse 
notation and denote it also by r/. We have that r/ o q is a character of Zi and since 
</: Z\ — > Z 2 is finite to one, ||?7 o q\\ < Ci\\r]\\ for some constant C\. 

Since T n hTh^ 1 n G' has finite index in T Pi G', there exists £ G N such that 7^ G 
rn/ir/i _1 nG' for every 7 G TnG'. Therefore, since the restriction of rjoq to TnhTh~ 1 nG' 
is trivial, for every 7 G T Pi G' we have £r/ o 5(7) = r) o q{rf) = 1. Hence, tr\o q has a 
trivial restriction to G^rn G')and so there exists a character £ of Z' with Ir/oq = (op. 
We have 7^ ||(|| < C2 1 1 -^77 q\\ < C3 1 1 ^7 1 1 < G3^ for some constants G2, G3. We consider 
£ as a horizontal character of X' = G'/ (G' Pi T) and thus as a character of G'. 

For every n G N we have ((g'(n)) = £rj(g'(n)) and thus, by hypothesis (|35p . ||£ o 
ff'llc 00 ^] — G4D for some constant G4. By Lemma 15.21 the sequence (g'(n) ■ ex) n e[N] 
is not totally G5-D -1 -equidistributed in X' for some constant G5. Letting 5 = C^D -1 
completes the proof. □ 

Lastly, we give some uniform discorrelation estimates that serve as a model for the 
more complicated estimates obtained in the sequel. The argument is based on Katai's 
criterion (Lemma 13. 2p and the Abelian version of Theorem 15.11 which is nothing more 
than a suitable use of Weyl's estimates. 

Proposition 5.6 (Model discorrelation estimates for XN.u)- Let e > 0. Then there exists 
9 := 9(e) such that, for every sufficiently large N, depending only on e, the following 
holds: If 

XN,s ■= XN * 4>NM, XN,u ■= XN ~ XN,s, 
where xn = X " 1[JV] an d <t>N,6 is the kernel defined by (|25p , then 

(36) sup |E f ^XAr,«(n)e(n 2 a)| < e. 

Proof (Sketch). Let e > and be sufficiently large depending only on e (how large 
will be determined below). 

Let 9 := 9(e) > be given by (140 p below and for this value of 9 let 4>n,8 be given by 
(|25p . Theorem 13.11 implies that for sufficiently large N, depending only on e, we have 



(37) \\xN,u( n )\\uHz^) <0(e). 

We claim that the asserted estimate (|36p holds. Arguing by contradiction, suppose that 

( 38 ) \ E ne[N]XN,u( n ) e ( n2a )\ > £ 

for some x £ M and a G R. We consider two cases depending on the total equidistribu- 

Kie[N]- 



tion properties of the sequence (n 2 a) 
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Minor arcs. We use Proposition 15.41 with s = 1 and e/3 in place of r. We get that 
there exists a := cr(e), such that for all sufficiently large N, depending only on e, if the 
sequence (n 2 a) nl -^ is totally cr-equidistributed, then 

ma ?~ l E ne[JV]X(»)e((m + n) 2 a)\ < -e. 

m£[-N,N] 1 

Using this and the fact that xn,u = Xn * (1 — 4») where <j) is a kernel on Zs, we deduce 
(see Section [6~8l for details) that for all sufficiently large N, depending only on e, we have 

\ E ne[N]^A n ) e ( n2a )\ < e 

which contradicts (|38p . 

Major arcs. Suppose now that the sequence (n 2 a) n( ^^ is not totally cr-equidistributed 
where a was defined in the minor arc step. Then, as is well known (and also follows by 
Lemma l5.2p . a has to be close to a rational with a small denominator, more precisely, 
there exist positive integers Q,R that depend only on a, and consequently only on e, 
and positive integers p,q < Q such that 

P 



a 



R 
< —. 

~ N 2 



We factor the sequence (n 2 a) n( -^ as follows 

n 2 a = e(n) + j(n), where e(n) := n 2 ( a — — ) , j(n) := n 2 — . 

V q) q 

Note that |e(n + 1) — e(n)| < 2R/N for n £ [N], Furthermore, the sequence 7(71) is 
periodic with period q. After partitioning the interval [N] into sub-progressions where 
e(n) is almost constant and "f(n) is constant, and using the pigeonhole principle, it is not 
hard to deduce from (|38p (see Section 16.51 for details) that 

( 39 ) KelN]^) ■ XN,u(n)\ > 

for some arithmetic progression P C [N] provided that iV is sufficiently large depending 
only on e. Using (|39p and Lemma [2.61 we deduce that 

1 e 2 

(40) \\XN, u {n)\\ip { z s) > =: 9(e) 

where c\ is a universal constant. This contradicts (|37p and completes the proof. □ 

In the next section we prove a strengthening of the previous result where the place of 
(e(n 2 a)) takes any two step nilsequence (<I>(a n • ex)) where $ is a function on a 2-step 
nilmanifold with Lipschitz norm at most 1. Our proof is much more complicated in this 
case but the basic strategy remains the same as in the previous argument. 

6. Higher order Fourier analysis of multiplicative functions 

The goal of this section is to prove the main decomposition result stated in Theo- 
rem The key ingredient that enters its proof is the following "weaker" decomposition. 

Theorem 6.1 (Weak uniform decomposition for the C/ 3 -norm). For every 9q > and 
e > 0, there exist a positive real 8 < 8q, and positive integers Q := Q(£,9q) and R := 
R(s,9o), with the following properties: For every sufficiently large N, depending only on 
9q and e, and for every x £ -M, the function \N admits the decomposition 

XN(n) = XN,s( n ) + XN,u( n ) f or ever V «^jv. 
where the functions \N s an d XN u satisfy: 
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(i) Xn,s = Xn * 4>N,e? where <pN,e * s the kernel on Z^ defined by (|23p and is inde- 
pendent of x, an d the convolution product is defined in Z^; 

R 

(ii) \xN,s(n + Q) — XN,s(n-)\ < -~ for every n G Z^, where n + Q is taken mod A 7 "; 

(iii) ||XJV,«||i73(z-) < e. 

The proof of Theorem 16.1 1 takes the largest part of this section. The main disadvantage 
of this result is that the bound on the uniform component is not strong enough for our 
applications. In Section [6l3l we combine Theorem 16.11 with an energy increment argument 
to prove Theorem 11.61 that gives very strong bounds on the uniform component. 

6.1. Some preliminary remarks and proof strategy. A substantial part of our proof 
is consumed in handling correlation estimates of arbitrary multiplicative functions with 
2-step nilsequences of bounded complexity. Our proof strategy follows the general ideas 
of an argument of Green and Tao from |14| I18j where uniformity properties of the Mobius 
function were studied. In our case, we are faced with a few important additional difficul- 
ties stemming from the fact that we are forced to work with all multiplicative functions 
some of which are not [7 2 -uniform (see the example in Section |1.5p . Furthermore, we 
have to establish estimates with implied constants independent of the elements of Ai. 
We give a brief summary of our strategy next. 

To compensate for the lack of [7 2 -uniformity of a multiplicative function \ we subtract 
from it a suitable "structured component" Xs given by Theorem 13.11 so that Xu = X ~ Xs 
has extremely small C/ 2 -norm. Our goal is then to show that \u has small [7 3 -norm. 
In view of the C/ 3 -inverse theorem of Green and Tao (Theorem |4.2p . this would follow 
if we show that Xu has very small correlation with all 2-step nilsequences of bounded 
complexity. This then becomes our new goal. 

The factorization theorem for nilsequences (Theorem I4.5|) practically allows us to treat 
correlation with major arc and minor arc 2-step nilsequences separately. Orthogonality 
to major arc (approximately periodic) nilsequences is easily implied by the ^-uniformity 
of Xu- So our efforts concentrate on the minor arc (totally equidistributed) nilsequences. 
Combining the orthogonality criterion of Katai (Lemma 13 . 2 1) with some quantitative 
equidistribution results on nilmanifolds (Theorem 15. ip . we deduce that the arbitrary 
multiplicative function x is asymptotically orthogonal to such sequences (Propositions 1531 
and 15. 4p . The function Xu is n °t multiplicative though, but this is easily taken care by 
the fact that Xu = X ~ Xs an d the fact that Xs can be recovered from x by taking a 
convolution product with a positive kernel. Using these properties it is an easy matter to 
transfer estimates from x to Xu- Combining the above, we get the needed orthogonality 
of Xu to all 2-step nilsequences of bounded complexity. Furthermore, a close inspection 
of the argument shows that all implied constants are independent of x- This suffices to 
complete the proof of the decomposition result. 

Although the previous sketch communicates the basic ideas behind the proof of Theo- 
rem UTTJ the various results needed to implement this plan come with a significant number 
of parameters that one has to juggle with, making the bookkeeping rather cumbersome. 
We use the next section to organize some of these data. 

6.2. Setting up the stage. In this subsection we define and organize some data that 
will be used in the proof of Theorem 16.11 We take some extra care to do this before the 
main body of the proof in order to make sure that there is no circularity in the admittedly 
complicated collection of choices involved. The reader is advised to skip this subsection 
on a first reading and refer back to it only when necessary . 

Essentially all objects defined below will depend on a positive number e and on the 
integer t defined in Section 11.51 We consider these parameters as fixed, and to ease 
notation we leave the dependence on e and t implicit most of the time. 
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Furthermore, most objects defined below also depend on a positive integer parameter 
M that we consider for the moment as a free variable. The explicit choice of M takes 
place in Section [6^1 and depends on various other choices that will be made subsequently; 
what is important though is that it is bounded above and below by positive constants 
that depend only on e. The parameter M plays a central role in our argument, and to 
avoid confusion we keep the dependence on M explicit most of the time. 

Most objects we define use the families of nilmanifolds J~(M) introduced in Corol- 
lary 14.51 We would like to stress that these families do not depend on the function 
u that occurs in this statement. This allows us to postpone the definition of u until 
Section WTQ 

6.2.1. Objects defined by the inverse theorem. Throughout the argument we let e be a 
fixed positive number. Corollary 14.31 defines the objects 

% := T-L(e), 5 := 5(e), m := m(e) 

where Ti is a finite family of nilmanifolds with vertical torus of dimension 1, 5 is a 
positive number, and m is a positive integer. In the sequel we implicitly assume that N 
is sufficiently large, depending only on e, so that the conclusion of Corollary 14.31 holds. 

6.2.2. Objects associated to every fixed nilmanifold in %. Let M G N be fixed and X : = 
G/T be a nilmanifold in %. We define below various objects that depend on M and X. 

By Corollary IA.3| for every M G N there exists a finite subset E := E(M, X) of G, 
consisting of M-rational elements, such that for every M-rational element g £ G there 
exists h G E with h~ 1 g E T, that is, g ■ ex = h ■ ex- We assume that 1q G E. 

Let J- := J-(M, X) be the family of sub-nilmanifolds of X defined by Corollary 14.51 
We define a larger family of nilmanifolds T 1 := J-'(M,X) that have the form 

Y := G' ■ ey = G' /(hTh^ 1 n G') 

where 

X' := G'/V G J 7 , h G E, and e Y = h ■ e X - 

By Lemma 14. 1| there exists a positive constant H := H(M, X) with the following 
properties: 

(i) for every h G E, every g G G with da(g, 1g) — M, and every x,y £ X, we have 
d x (gh -x,gh-y) < Hd x {x,y) ; 

(ii) for every h G E, every g G G with dc(g, 1g) — M, and every function G 
C 2m (X), we have \\(p g h\\c^(X) < H \\4>\\c 2 ™(X) where (j) gh (x) := cf)(gh ■ x). 

The distance on a nilmanifold Y G T' is not the distance induced by the inclusion in 
X. However, the inclusion Y C X is smooth and thus we can assume that 

(iii) for every nilmanifold Y G T' and every x,y G Y, we have dx(x, y) < Hdy(x, y); 

(iv) for every Y G J 7 ' and every function on X, we have \\(j>\ ||c 2m (Y) — -^ll^llc 2m (X)- 

By Lemma 15.51 for every X' G J 7 , every £ > 0, and every /i G E, there exists p := 
p(M, X, X' , /i, £) with the following property: 

(v) Let X' = G'/V G J~, h G S, ey := h ■ ex, and (fl ,/ (^))nG[A''] polynomial sequence 
in G' with degree at most 2; if (g' ' {n)ex) n &[N} ls totally p-equidistributed in X' , 
then (g'(n)eY)ne[N] 1S totally £-equidistributed in Y := G' ■ ey- 

6.2.3. Objects associated to 7i. We consider now all the nilmanifolds belonging to the 
family Ti and define the finite family of nilmanifolds 

LF'(M) = (J F'(M,X) 
xeH 
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and the positive numbers 

(41) H(M) : = max H(M,X) ; 

(42) P(M,Q:= min p(M, X, X' , h, (), 

X'£F(M,X), heS(M,X) 

where p(M, X, X' , h, £) was defined by item (jvj) above. 
We let 

(43) 8 1 {M) : tr 



(44) 9(M) : = min 



5M 2 ' 
5 5i{M) 



,2 ' 2 ci 

where <5 was defined by Section IFT21 and ci is the universal constant defined by Lemma [2 .61 
To every r > and every nilmanifold Y in the finite collection ^'(M), either Propo- 
sition 15,31 or Proposition 15.41 (applied for Y in place of X) associates a positive number 
o~(Y, t) (depending on whether Y in non-Abelian or Abelian). We define 

5i(M) 



(45) a(M) := min o(y , 

YeJ-'CAf) V 



YG.F'(M) V ' 10 H(M) 2 J' 

6.2.4. The function uj and bounds for M . We let u: N — > M + be the function defined by 

(46) uj{M) := p{M,a(M)) 
where p is defined in (|42p and <r is defined in (|45p . We also let 

(47) M :=r2/e]. 

For this choice of uj and Mo, Corollary 14.51 associates to every nilmanifold X £ H a, 
positive real number M\(Mq, X, oj) and we define 

(48) Mi := maxMi(M ,I,w). 
We stress that M\ depends only on e. 

6.3. Using the [/^-decomposition. After setting up the stage we are now ready to 
enter the main body of the proof of Theorem 11.61 
Let e > 0. Let Mi be given by (g8]) and let 

(49) 9 X := 0(Mi) 

be given by (|44p . Note that 6\ depends on e only. We use Theorem 13. II for 6 substituted 
with this "very small" value of 9\. We get that for every sufficiently large N, depending 
only on e, every x £ -M admits the decomposition 

Xn = XN,s + XN,u 

where XN,s an d XN,u satisfy the conclusions of Theorem I3.lt in particular 

( 50 ) WXNtuWu' 2 ^^) < ®1- 

We claim that XN s an d XN,u satisfy the conclusion of Theorem 16. II Theorem 13.11 gives 
at once that Properties (jT|) and (|n]) of Theorem 16.11 are satisfied with Q := Q(M\) and 
R := R(Mi). It remains to verify Property ([mj) . namely that 

IIXAT,«||t/3(%) < e - 
We argue by contradiction. We assume that 

( 51 ) WXN^Wu^iZfj) > e -> 



UNIFORMITY OF MULTIPLICATIVE FUNCTIONS AND PARTITION REGULARITY 



33 



and in the next subsections we are going to derive a contradiction. To facilitate reading 
we split the proof into several parts, and to ease notation, we continue to leave the 
dependence on e, I implicit. 

6.4. Using the inverse and the factorization theorem. Suppose that ([5T|) holds. 
We recall that in Section 16.21 we used Corollary 14.31 to define the positive integer m, 
the positive real number 5, and a finite family T~L of nilmanifolds with vertical torus of 
dimension 1. We recall also that these objects depend only on e and we assume that N 
is sufficiently large so that the conclusions of Corollary 14.31 hold. 

If the alternative ([!]) of Corollary 14.31 holds, then 

IIXJV,w||c/ 2 (%) > <5> 

and since by (JUJ) and flU we have 6 X = 9{M X ) < 5/2, this contradicts (f50|) . 

As a consequence, alternative ([n]) of Corollary 14.31 holds. Our goal is to show that for 
the particular choices made in the previous subsections we get again a contradiction. 

By our assumption, there exists a 2-step nilmanifold X = G/T belonging to the family 
T~L, a nilcharacter $onI with frequency 1, and an element g of G, such that 

(52) ||^|| C 2m( X ) < 1, and 

(53) \E ne[F j ]X N,u(^(9 n -ex)\>5 

where m is the dimension of X. Note that X, ty, and g, will depend on \i bvrt this is 
not going to create problems for us. 

Recall that for every M 6 N the finite family T{M, X) of sub-nilmanifolds of X was 
defined in Section 16.2.21 using Corollary 14.51 of the factorization theorem. Next we apply 
this corollary for the sequence (g n ) n£ ^ in G, the function oj: N — > M + defined by (|46|) . 
and the integer Mq defined by (|47p . For M\ given by ([48 P we get an integer M with 

(54) M < M < Mi, 

a nilmanifold X' = G' /V belonging to the family T(M, X) that satisfies either Prop- 
erty (jl]) or Property §u§ of Corollary 14. 5[ and a factorization 

(55) g n = e(n) 9 / (n) 7 (n) 

into polynomial sequences in G that satisfy Properties ([m)l . (jiv|) . (jvj) of Corollary 14.51 for 
this value of M. The number M depends on x, but it belongs on the interval [Mo, Mi], 
and since the integers Mo, Mi depend only on e, this suffices for our purposes. 
From this point on, we work with this choice of M. 

6.5. Eliminating e and 7 by passing to a sub-progression. Our goal on this and 
the next subsection is to get an estimate of the form (|53|) with the additional property 
that the sequence {g n ■ ex) ne ^ is sufficiently totally equidistributed. To achieve this we 
pass to an appropriate sub-progression where the sequences e and 7 defined by (|55|) are 
practically constant and then change the nilmanifold defining the nilsequence to eliminate 
some extra terms introduced. 

By Property (jvj) of Corollary 14. 5[ the sequence ( 7 (n)) is periodic of period at most M; 
we denote its period by p. Let 



(56) L :-- 



2M 2 



N 



We partition [N] into arithmetic progressions of step p and length L and a leftover set 
that we can ignore since it will introduce error terms bounded by a constant times S/M 
and thus negligible for our purposes (upon replacing 5 with 5/2 below). Using (|53p and 
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the pigeonhole principle, we get that there exists a progression P with step p and length 
L such that 

( 57 ) Ke[mMn)XNAnmg n -ex)\>si = ^-i. 

Let no,n £ P. We have 7(71) = 7(no) and by Property ([m|) of Corollary 14.51 the 
sequence (e(n)) is (M, iV)-smooth. Using these properties and the right invariance of the 
metric da, we get 

M M M 2 L 

dc(g n , e(n )#'(n)7(no)) < d G (e(n), e(n )) < |n - n \ < pL < 

N N N 

Since by ([52]) the function ^ has Lipschitz constant at most 1, it follows that 

M 2 L 



\®{g n ■ ex) ~ ^(e(n )g'{n)^(n ) ■ e x )\ < 

N 



From this and (1561) we deduce that 



L M 2 L 5 2 

\e[N] 1p ^\ XN,u( n )\ ■ ex) ~ *(e(noy(n)7(«o) • e x )\ < ^ -~- < jj^- 
Combining this with ([57p and the definition of 5i given by ([43 P we get 

5 2 

( 58 ) |E ne[ ^ ] lp(n)x7V, u (n)^(e(n )<7 / (n)7(no) ■ e x ) \ > ^ = *i(M) 
provided that N > 20M 1 2 /<5 (then 8/N < <5 2 /(20M 2 )). 

6.6. Changing the nilmanifold. By Property (jvj) of Corollary 14. 5 1 7(no) is M-rational. 
By the definition of S(M, X) in Section [6.2.21 we can choose Hq G S(M, X) such that 
7(^0) ' ex = ho ■ ex- We define 

ey :=h -ex ; Y := G" • ey = G'/^or/i^ 1 n G') ; 

= *(e(n ) • x). 

Note that the nilmanifold Y belongs to the family P'(M) defined in Section [6.2.31 For 
every n E N we have 

(59) *(e(n )</(n)7(n ) ■ ex) = *'{g'{n) ■ e Y ). 

Since the sequence ( e ( n )) n6 rj\n * s (-^j AO-smooth, we have d(y(e(rio), If?) 5; M. Fur- 
thermore, since || x I / ||c 2m (x) — 1; by Property (|n]) of Section [6.2.21 we have || x I' / ||c 2m (x) — 
H(M) where H(M) was defined by (|4ip . Since the nilmanifold 1" belongs to the family 
J-'(M), by Property (jivj) of Section [6.2.21 we have 

(60) l|*'l y llc^(Y) < H(M) 2 . 

Recall that the sequence (g'(n) ■ e x) ng ^] arises from the decomposition ([55]) provided 
by Corollary 14.51 by Property (jiv|) of this corollary, the sequence {g'{n) ■ e x) n( z^ is 
cj(M)-totally equidistributed in X' . 

Since by the definition of oo (see (06])) we have ui(M) = p(M,a(M)), Property (jvj) of 
Section 16X21 and (g2]) give that 

(61) the sequence (g (n) • ey) ng rjv] is totally a(M)-equidistributed in 1". 
Summarizing, we have so far established that 

(62) |E n6[ ^lp(n) XjV)U (n)*'( 5 >) ' e ^)l ^ *i( M ) 

for some arithmetic progression P C [iV] and properties (j60p and (j6ip are satisfied. In 
the next subsection we further reduce matters to the case where the function has 
integral zero. 
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6.7. Reducing to the zero integral case. Our goal is to show that upon replacing \&' 
with — z, where z = f Y ^'dmy, we get a bound similar to ([62]). To do this we make 
crucial use of the fact that the [7 2 -norm of XN,u is suitably small, in fact, this is the step 
that determined our choice of the degree of L r2 -uniformity 9\ of XN,u- We write 



J *' dmy and *" = 



Combining Lemma 12.61 equations f|44|) . (|49p . and estimate (|50p we get 

|E n6[ ^l P (n)zxJV,«(n)| < ci||xjv, u ||c/2 (%) < c^i = Cl 0(Mi) < -ft (Mi) < -tfi(M) 

where the last estimate follows from (|43p and the fact that M < Mi . From this estimate 
and (1621) we deduce that 



(63) \E ne[ ^lp(n)xN,u( n )^"(9'(n) ■ e Y )\ > ^i(M) 
where 

(64) \\^"\ Y \\ C 2 m( Y) < H(M) 2 and / ^"dm y =0. 
We recall that Property (|61J) is also satisfied. 

6.8. Proof of the weak [/^-decomposition result. We are now very close to com- 
pleting the proof of Theorem 16.11 To this end, we are going to combine (|63p . the total 
equidistribution of the sequence (g'(n) -ey) ng ^p and Propositions 15.31 and 15.41 to deduce 
a contradiction. 

Recall that XN,s = XN * 4> (the convolution is taken in Z«) where is a kernel in Zk, 
meaning a non-negative function with E ng z~</>(n) = 1. Since x« = XJV ~~ Xs, we c & n write 
XiV,u = XN * ipi where ip is a function on with E ne z^ |VK n )l — 2- We deduce from 
that there exists an integer q with < q < N such that 



(65) \E ne[Sj] l P (n + q mod JV)xjv(n)tf"( 5 '(™ + Q mod iV) • ey)| > - 5\(M) 

where the residue class n + q mod N is taken in [N] instead of the more usual interval 
[0,iV). It follows that 

\E ne[ ^l P (n + m) lj(n) l [N] {n) X (n)^"{g'(n + m) ■ e Y )\ > - S^M) 

where either J is the interval [N — q] and m = q, or J is the interval (TV — q, N] and 
m = q — N . We remark that in both cases lp(n + m) lj(n) l[jyi(n) = lp 1 (n) for some 
arithmetic progression Pi C [N] and thus we have 

(66) \E nm l Pl (n)x(n)*"(g'(n + m)-e Y )\ > ^i(M). 

Recall that || x I / "| y ||c 2m (Y) — H(M) 2 . By (|6ip . the sequence (g'(n) ■ ey) ne rjyi is to- 
tally cr(M)-equidistributed in Y. We remark that, depending on the case (jn|) or (jl]) of 
Corollary 14.51 Proposition 15.31 or Proposition 15.41 correspondingly can be applied to the 
nilmanifold Y and the sequence (</(n)) ne ^. Since by the definition of a(M) (see (|45|) ) 
we have 

a(M)<a(Y, * l(M) 



10#(M) 2 /' 
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by using one of the two propositions we deduce that 

KelN] 1 ?^ (n)x(n)t>'(g'(n + m) ■ ey)\ < 

\E ne[N] l Pl (n) X (n)V(g'(n + m)-e Y )\ < J^)2 W*"\ Y \\c*™<Y) < 

which contradicts ([65]) . Hence, ()5ip fails, giving us that llXjV.ullt^fZ^) — e - This com- 
pletes the proof of Theorem 16,11 □ 

6.9. Proof of the strong [/^-decomposition on the average. In this subsection we 
prove Theorem 1 1.61 Our basic ingredient is the weak decomposition result of Theorem l6.11 
We use it in an iterative way in an argument of energy increment; this is made possible 
because of the simple structure of XN,s and the particular form of the kernels <f>N,9 that 
occur in Theorem 16.11 We recall that these kernels were defined by (|23p in Section 13.31 
the most important property used in the subsequent argument is the monotonicity of 
their Fourier coefficients: 

if < 0' < Q, then ^J(f) > <faw(£) > for every f G Z#. 

We fix a function F:NxNx R + — > M + , an e > 0, and a finite positive measure v 
on the compact group Ai of multiplicative functions. We can assume without loss of 
generality that v is a probability measure. 

We define inductively a sequence (Qj) of positive reals and sequences (Nj), (Qj), (Rj) 
of positive integers as follows. We let Q\ = N\ = Q\ = R\ = 1. Suppose that j > 1 and 
that the first j terms of the sequences are defined. We apply Theorem 16.11 with 

1 

F(Q~Rj,e) 

Let Nj + i be such that the conclusions of Theorem 16.11 hold with these data as input for 
every N > Nj + \. We get positive integers Q and R that we write respectively as Qj+i 
and Rj+i, and a real Qj+\ with < Oj+i < Qj, such that for every N > Nj+i and every 
X G Ai, the functions 

Xj+i,N,s '■= Xn * 4>N,e J+1 and Xj+i,N,u ■= Xn - Xj+i,N,s 
satisfy the Properties dTTJ) and (|ul|) of Theorem 16. 1[ i.e., 

R ■ 

( 67 ) \xj+i,N, s ( n + Qj+i) - Xj+i,N, s ( n )\ ^ -^p for ever y " eZ j/i 

(68) \\x j+ i,nAu H ^) < Flg-R-Jy 

Replacing Nj+i with maxj<, N we can assume that the sequence (Nj) is increasing. By 
construction, the sequence (Qj) is decreasing. 

Let J = 1 + [2e -2 ] and let N Q = N J+V For every N > N Q we have 



Qj substituted for Qq and tti//0 — — — — substituted for e. 



E / \\Xj+l,N,a ~ Xj,N,s\\h(Z =) MX) = 
j=2 JM 

r J 



/A1 i=2 



J 
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where to get the last estimate we used that #jv,j+i < &N,j an d thus (f>N.e j+1 (0 > 

4>N,9j(0 > for every £ by (fT6|) . Since |0jv,0(£)l — 1> the l as t quantity in the esti- 
mate is at most 




Therefore, for every N > Nq there exists jo := jo(F, N, e, v) with 

(69) 2 < j < J 
such that 

(70) / ||Xio+i,JV, 
JAW 

For A" > A^o, we define 

iPn,i ■= <t>N,$ jo ; *I>n,i ■■= 4>N,0 jo+1 ; 

XN,s ■= XN * 1pN,l = Xjo,N,s ; XN,u ■= XN ~XN* 1pN,2 = Xjo+l,N,u ] 
XN,e ■= XN * (lpN,2 ~ 1pN,l) = Xjo+l,N,s ~ Xjo,N,s \ 

Q := Qj and R := Rj . 
Then we have the decomposition 

XN = XN,s + XN,u + XN,e, 

and furthermore, Property (juj) of Theorem ll,6l follows from (j67|) (applied for j = jo — 1), 
Property (|m|) follows from (|68p (applied for j = jo), and Property ([rvjl follows from (j70|) 
and the Cauchy-Schwarz estimate. Furthermore, it follows from (|69p that the integers 
Nq,Q,R are bounded by a constant that depends on F and e only. Thus, all the an- 
nounced properties are satisfied, completing the proof of Theorem 11.61 □ 

Appendix A. Rational elements in a nilmanifold 

In this section, working with 2-step nilmanifolds does not provide any simplification 
and so the results are stated for general nilmanifolds. 

Let X = G/Y be an s-step nilmanifold of dimension m. As everywhere in this article 
we assume that G is connected and simply connected, and endowed with a Mal'cev basis. 
Recall that we write ex for the image in X of the unit element \q of G. 

From Property ([ml) of Mal'cev bases stated in Section 14.11 we immediately deduce: 

Lemma A.l. V is a finitely generated group. 

A.l. Rational elements. We recall that an element g G G is Q-rational if g m 6 T for 
some m G N with m < Q. We collect here some properties of rational elements. We note 
that all quantities introduced below depend implicitly on the nilmanifold X. 

Lemma A. 2 ([17, Lemma A. 11]). (i) For every Q G N there exists Q\ G N such 

that the product of any two Q-rational elements is Q\-rational; it follows that 
the set of rational elements is a subgroup ofG. 

(ii) For every Q G N there exists q G N such that the Mal'cev coordinates of any 
Q-rational element are rational with denominators at most q; it follows that the 
set of Q-rational elements is a discrete subset of G. 

(iii) Conversely, for every q G N there exists Q G N such that, if the Mal'cev coordi- 
nates of g G G are rational with denominators at most q, then g is Q-rational. 

Corollary A. 3. For every Q G N there exists a finite set S := X(Q) of Q-rational 
elements such that all Q-rational elements belong to Ti{Q)T. 



Xj ,N r 



du{x) < 



J -I 



< e z 
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Proof. Let K be a compact subset of G such that G = KT. 

Let Q G N. Let Qi be associated to Q by Part ([!]) of Lemma lA.21 and let Si be the 
set of Qi-rational elements of K. By Part (jn]) of Lemma IA.21 Si is finite. Let g be a 
Q-rational element of G. There exists 7 G T such that g7 _1 G K. Since 7 is obviously Q- 
rational, g^~ l is Qi-rational and thus belongs to £1. For each element h of Ei obtained 
this way we choose a Q-rational point g such that h G gT. Let £ := S(Q) be the set 
consisting of all elements obtained this way. We have that XT contains all Q-rational 
elements. Furthermore, |S| < |Si| and so £ is finite, completing the proof. □ 

A. 2. Rational subgroups. We gather here some basic properties of rational subgroups 
that we use in the main part of the article. 

Definition. A rational subgroup G' of G is a closed and connected subgroup of G such 
that its Lie algebra q' admits a base that has rational coordinates in the Mal'cev basis 
of G. 

An equivalent definition is that F := Tn G 1 is co-compact in G. In this case, G' /V is 
called a sub-nilmanifold of X. 

Lemma A. 4 ([IT], Lemma A. 13). If G 1 is a rational subgroup of G and h is a rational 
element, then hG'h~ l is a rational subgroup of G. 

Proof. The conjugacy map h 1— > g~~ 1 hg is a polynomial map with rational coefficients and 
thus the linear map Ad^ from q to itself has rational coefficients. Since g' has a base 
consisting of vectors with rational coefficients, the same property holds for Ad^Q, that 
is, for the Lie algebra of hG'h^ 1 . This proves the claim. □ 

The argument used to deduce Lemma lA . 1 1 shows that the group T n hG'h^ 1 is finitely 
generated. 

Lemma A. 5. Let X = G/T be an s-step nilmanifold, G' C G be a rational subgroup, 
and g G G be a rational element. Then 

(i) r n g~ 1 Tg flG' is a subgroup of finite index ofTd G' ; 

(ii) r n g~ 1 Tg C\G' is a subgroup of finite index of g~ l Yg n G 1 . 

Proof. By Part ([!]) of Lemma [A. 21 all elements oi gTg^ are rational. Hence, if 7 G TClG' 
there exists k G N with {g^g~ l ) k G T and so we have 7^ G g~ l Tg flTn G' . Applying 
Lemma TA-ll to G' and T(~)G' we get that this last group is finitely generated. By induction 
on s it is easy to deduce that g~ 1 Tg nTnG' has finite index in T n G' . This proves 
Since ^r^ -1 is co-compact in G, substituting this group for G and g^ 1 for g in the 
preceding statement, we get (|n|). □ 

Lemma A. 6. Let g G G be a rational element and G' a rational subgroup of G. Then 
G'g ■ ex '■= {hg - ex - h G G'} is a closed sub-nilmanifold of X. 

Proof. By Lemma |A.4| g~ 1 G'g is a rational subgroup of G. Therefore, T n g~ 1 G'g is 
co-compact in g~ 1 G'g and thus ffFc/ -1 n G' is co-compact in G' . But gTg^ 1 n G' is the 
stabilizer {h G G' : h ■ g ■ ex = g • &x} of g ■ ex in G' and thus the orbit G' ■ (g ■ ex) is 
compact and can be identified with the nilmanifold G' /{gTg~ l n G'). □ 

Appendix B. Solution sets related to some homogeneous quadratic forms 

We give a proof of Proposition 11.41 from the introductory section. 

Proposition B.l. Let the quadratic form p satisfy the hypothesis of Theorem \l.l[ Then 
there exist positive and £2, £3 non-negative integers with £2 7^ £3, such that for every 
k,m,n G N, the integers x = k£om(m + £\n) and y = k£o(m + £2n)(m + £%ri) satisfy the 
equation p(x, y, z) = some z G N. 
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Proof. Let 

(71) ax 2 + by 2 + cz 2 + dxy + exz + fyz = 

be the equation we are interested in solving. Recall that by assumption a, b > and 
c < 0, and that all three integers 

At := e 2 - 4ac, A 2 := f 2 - Abe, A 3 := (e + f) 2 - 4c(a + b + d) 

are non-zero squares. 

A direct computation shows that if (xo^Cb^o) ^ s a solution of (|7ip . then also the 
following is a solution 

x =k(—(axQ + dyo + ezo)m 2 — (2byo + fzo)mn + bx^n 2 ) 

y =k(ayom 2 - (2ax + ez )mn - (by + dx + fz )n 2 ) 

z =kzo(am 2 + dmn + bn 2 ) 

where k, m, n, £ Z@ 

The discriminant of the quadratic form 

Qi(m,n) := —(axo + dyo + ezo)m 2 — (2byo + fzo)mn + bxon 2 

turns out to be z 2 /S.2 which is a square since by assumption A 2 is a square. Hence, 
Qi(m,n) factors into a product of linear forms with rational coefficients. Similarly, the 
discriminant of the quadratic form 

Q2(m, n) := ay m 2 - (2ax + ez )mn - (by + dx + fz )n 2 

turns out to be z 2 A\ which is a square since by assumption Ai is a square. Hence, 
Q2(m,n) factors into a product of linear forms with rational coefficients. 

The assumption that A3 is a square is used to guarantee that a non-trivial choice of 
Xo, 2/0) z can be made so that the coefficients of m 2 in the quadratic forms Qi(m, n) and 
Q2(m,n) are equal, i.e. — (axQ + dyo + ezo) = ayo is satisfied. Indeed, if we multiply 
equation (171]) by e 2 and insert — (axo + (d + a)yo) in place of ezo, we are lead to the 
equation 

a cx + a(2cd + 2ac — e — e/)xo?/o + (£>e + c<i + a c + 2ac<i — def — aef)y = 0. 

A direct computation shows that its discriminant is a 2 e 2 A3, which is square since by 
assumption A3 is a square. This leads to the following solution of (|7ip 

xo = 2ac + 2cd — e 2 — ef + ey^K^ 
y = -2ac 

zq = a(e + / + \/A7) 

which is non-trivial since yo > 0. We work with these choices of XQ,yo, zq next. 

Combining the above, we deduce that under the stated assumptions on a, b, c, there 
exist 1%, . . . , Is £ Z, with /1/3 = /5Z7 7^ 0, such that for every k,m,n £ 7L the integers 

x = k(lim + l2n)(l?,m + l^n), 

y = k{l^m + lQn)(lim + Zgn) 

satisfy equation (|7ip for some z := z m ^ n £ N. Inserting lil^l^ljn in place of n, we get 
that there exist l[, . . . , l'^ £ Z, such that for every k, m, n £ Z the integers 

x = k£o(m + / / 1 n)(m + £ 2 re )> 

y = k£o(m + l' 3 n)(m + /^n), 

where ^0 = IW3I = |^5^7| 7^ 0, satisfy equation ()7ip for some := G N. 

^One finds these values by looking for solutions of the form x — txo + m, y = tyo + n, z = tzo, which 
leads to the choice t = —(am 2 + dmn + bn 2 ) / ((2axo + rfj/o + ezg)m + (2byo + dxo + fzo)n). 
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Since the quadratic forms Q\ and Q2 have non-zero discriminant, we have l[ 7^ 1' 2 and 
I3 / IJ. Without loss of generality we can assume that l[ < l[ for i = 2,3,4. Inserting 
m — l[n in place of m, we get that for every k, m, n E Z the integers 

x = k£om(m + (1' 2 — l[)n), 

y = k£ (m + - /' 1 )n)(m + (l' 4 - l[)n) 

satisfy equation (I7TT) for some z" := z' 7 ' nn £ N. Letting £\ = l 2 — l[, £2 = I3 — 1±, ^3 = I4 — l\i 
we get the asserted conclusion with z!^ n in place of z. □ 

Alternatively, a proof that is free of computations can be given using the fact that the 
discriminant of the forms p(x, 0, z), p(0, y, z), p(x, x, z) is a non-zero square. We chose a 
more hands on argument since it determines the integers £\, £ 2 , £3 explicitly. 
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